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ABSTRACT 



The DNA of the invention are characterised in that they 
concern the whole or part of genes, with their reading frame, 
to be found in Neisseria meningitidis, but not in Neisseria 
gonorrhoeae, or in Neisseria lactamica except the genes 
involved in the biosynthesis of the polysaccharide capsule, 
frp A, frp C, opc, por A, rotamase the sequence IC1106, IgA 
protease, pilline, pilC, transferrin binding proteins and opac- 
ity proteins. The invention also concerns the polypeptides 
corresponding to these DNA and the antibodies directed 
against these polypeptides. It is applicable in the prevention 
and the detection of meningococcus induced infections and 
meningitis. 
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DNAS AND PROTEINS OR PEPTIDES 
SPECIFIC TO BACTERIA OF THE SPECIES 
NEISSERIA MENINGITIDIS, PROCESSES FOR 
OBTAINING THEM AND THEIR 

BIOLOGICAL USES 5 

The present application is a continuation of application 
Ser. No. 09/214,759, filed Apr. 22, 1999 now abandoned, 
whichisa371 application of PCT/FR97/01 295, filed Jul. 11, 
1 997, and claims benefit of FR 96 08768, filed Jul. 12, 1 996. lo 

DNAs and proteins or peptides specific to bacteria of the 
species Neisseria meningitidis, processes for obtaining them 
and their biological uses. 

The invention relates to DNAs and to proteins and 
peptides which are specific to bacteria of the species Nets- 15 
seria meningitidis (abbreviated below to Nm), to the process 
for obtaining them and to their biological uses, in particular 
for the prevention and detection of meningococcal infections 
and meningitis. 

It is known that Nm is one of the main agents of 20 
cerebrospinal meningitis. 

Studies conducted in the United States have shown that 
5 to 10% of the population are asymptomatic carriers of the 
Nm strain(s). The transmission factors of Nm are poorly 
known. For a proportion of persons infected, Nm penetrates 25 
the bloodstream, where it can cause meningococcaemia 
and/or progress to the cerebrospinal stream, to cause men- 
ingitis. Without fast antibiotic treatment, the infection can 
develop like lightning and become fatal. 

Compared with other pathogens, Nm has the character- 30 
istic of being able to cross the haemato-encephalic barrier to 
colonize the meninges. The study of the pathogenicity of 
Nm is therefore important not only in the context of 
meningitis, but also in the context of any disease which 
affects the brain. 35 

The benefit of having available tools specific to this 
species of bacteria for the uses envisaged above is therefore 
understood. 

Genetically, Nm is very close to bacteria of the species 
Neisseria gonorrhoeae (abbreviated to Ng below) and of the 40 
species Neisseria lactamica (abbreviated to Nl below). 
However, their pathogenicity is very different. 

Nm colonizes the nasopharynx, and then crosses the 
pharyngeal epithelium to invade the submucous space, thus 
being responsible for septicaemia and meningitis. 45 

Ng is especially responsible for infections located in the 
genitourinary tract. It colonizes the genital mucosa, and then 
crosses the epithelium, subsequently invading the 
subepithelium, where it multiplies and is responsible for a 
severe inflammatory reaction. Disseminated gonococcal 50 
infections are possible, but remain rare and are the result of 
only some strains. 

As regards Nl, it is considered that this is a non-pathogenic 
strain, since it is not responsible for a localized or general 
invasion. 55 

A first consideration thus led to taking into account the 
fact that Nm and Ng, while being bacteria very close to one 
another, have different pathogenic potencies. 

Since the genome of these bacteria has a high homology, 
only limited parts of the genome of Nm and Ng must code 60 
for specific virulence factors responsible for their pathogen- 
esis. 

It is clear that Nm has, compared with Ng, DNA 
sequences which are specific to it and which must be 
involved in the expression of its specific pathogenic potency. 65 

The species Nm is subdivided into serogroups based on 
the nature of the capsular polysaccharides. 
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At least 13 serogroups have been defined, among wliich 
serogroups A, B and C are responsible for about 90% of 
meningitis cases. Groups A and C are found in epidemic 
forms of the disease. Group B is the serogroup generally 
isolated the most in Europe and the United States. 

The capsule, which is present in Nm and absent fi*om Ng, 
has served as the basis for formulating meningococcal 
antimeningitis vaccines. 

The polysaccharides of the Nm capsule have been used to 
formulate a vaccine which has proved to be effective in 
preventing in adults the meningitis caused by meningococci 
of serogroups A, C, W135 and Y. 

However, the polysaccharide of Nm group C has proved 
to be weakly immunogenic in children of less than two 
years, while the polysaccharide of Nm group B is non- 
immunogenic in man and shares epitopes with adhesion 
glycoproteins present in human neuronal cells. 

There is therefore no universal vaccine capable of pre- 
venting infections caused by all the serogroups of the 
meningococci and capable of responding to the intrinsic 
antigenic variability of bacterial pathogens in general and 
Nm in particular. 

Because of the cross -reactivity of the Nm group B 
polysaccharide with human antigens, the multiplicity of the 
serogroups and the antigenic variability of Nm, the strate- 
gies proposed to date cannot lead to a vaccine which is 
effective in all situations. 

Research is therefore concentrated on study of the char- 
acteristic elements responsible for the specificity of the 
meningococcal pathogenesis. 

The majority of genes which have been studied in either 
of the two bacteria Nm or Ng have their homologue in the 
second bacterium. 

In the same way, the majority of virulence factors iden- 
tified to date in Nm have a counterpart in Ng, that is to say 
pilin, the PilC proteins, the opacity proteins and the recep- 
tors of lactoferrin and transferrin. 

The specific attributes of meningococci characterized in 
the prior art are the capsule, the Frp proteins analogous to 
RTX toxins, Opc proteins of the external member, glu- 
tathione peroxidase, the porin PorA and the rotamase gene. 

Among these, only the capsule is invariably present in the 
virulent strains of Nm. However, several extracellular patho- 
gens have a capsule without nevertheless crossing the 
haemato-encephalic barrier. 

Attributes which have not yet been identified must there- 
fore be responsible for the specificity of the meningococcal 
pathogenesis. These attributes are probably coded by DNA 
sequences present among the meningococci but absent from 
the gonococci. 

The inventors have developed a new approach based on 
subtractive isolation of Nm-specific genes, which genes 
must be linked to the specific pathogenesis of Nm, and more 
particularly to crossing of the haemato-encephalic barrier. 

The subtractive method developed in the prior art has 
resulted in the production of epidemological [sic] markers 
for some Nm isolates. These markers are of limited use: they 
do not cover all the serogroups of the Nm species. 

In contrast to these studies, the work of the inventors has 
led, by confronting Nm with the entire Ng chromosome 
sheared in a random manner, to the development of a means 
for cloning all the DNAs present in Nm and absent firom Ng, 
thus providing tools of high specificity with respect to Nm, 
and thus enabling the genetic variability of the species to be 
responded to for the first time. 

The terms "present" and absent" used in the description 
and claims in relation to the DNAs of a strain or their 
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expression products are interpreted on the basis of identical grouped in distinct regions, which are said to correspond to 

hybridization conditions (16 h at 65° C, with NaP04 0.5 M, the pathogenicity islets described previously for E. coli and 

pH 7.2; EDm-Na 0.001 M, 1%, 1% bovine serum albumin Y pestis, 

and 7% sodimn dodecyl sulphate) using the same probe and In a preferred embodiment of the invention, these DNA 

the same labelling intensity of the probe, the same amount 5 are thus also characterized in that they comprise one or more 

of cliromosomal DNA and the same comparison element sequence(s) present on the chromosome of Neisseria men- 

(chromosomal DNA of the homologous strain). ingitidis Z2491 between tufA and pilT, or region 1 of the 

It is therefore considered that the DNA is present if the chromosome, and/or the sequence(s) capable of hybridizing 

signal obtained with the probe is practically the same as that with the above sequence(s), with the proviso of being 

obtained with the reference strain. specific to Neisseria meningitidis. 

Conversely, it is considered that the DNA is absent if this "Specific" in the description and the claims means the 

signal appears very weak. nucleotide sequences which hybridize only with those of 

A second consideration of the pathogenicities of Nm and Nm under the hybridization conditions given in the 

Ng leads to taking into account their common capacity for examples and referred to above. 

colonization and penetration of the mucosa, and then inva- In this respect, it can be seen that, in a general manner, 

sion of the subepithelial space of the latter. It is highly when "all or part" of a sequence is referred to in the 

probable that this process involves virulence factors com- description and claims, this expression must be interpreted 

mon to the two pathogens. In this respect, it is known that with respect to the specificity defined above. 

a certain number of virulence factors have already been Furthermore, all or part of a peptide or a fragment of a 

identified in Nm and in Ng, such as the pili proteins, PilC, peptide or an antibody means a product having the biologi- 

the opacity proteins, the IgA proteases, the proteins for 20 cal properties respectively of the natural peptide or the 

binding to transferrin and to lactoferrin, and the lipooli- antibody formed against the peptide. 

go saccharides. Genes of the Neisseria meningitidis capsule are grouped 

The approach of the inventors is thus extended to inves- in region 1. 

tigation of the Nm regions which are specific to Nm and Ng DNAs of this type have a sequence corresponding in all 

but absent from the non-pathogenic species Nl, and in a 25 or part to SEQ ID No. 9, 13, 22 or 30, and/or to any sequence 

general manner to investigation of the chromosomal regions located at more or less 20 kb fi*om these SEQ ID on the 

of the DNAs and their expression products specific to a chromosome of an Nm strain, and/or have a sequence which 

given species by the means developed in accordance with is capable of hybridizing with at least a fragment of any one 

the invention. of these sequences. 

The object of the invention is thus to provide DNAs of 30 In another preferred embodiment of the invention, these 

Nm specific to its pathogenic potency and means for obtain- DNA are also characterized in that they are made up of one 

ing them, in particular by formulating banks formed exclu- or more sequence(s) present on the chromosome of Neis- 

sively fi*om these Nm-specific DNAs. seria meningitidis 72A9\ between pilQ and X740, or region 

It also provides the products derived from these DNA 2 of the chromosome, and/or the sequences(s) capable of 

sequences. 35 hybridizing with the above sequence(s), with the proviso of 

The invention also relates to the utilization of specific and being specific to Neisseria meningitidis. 

exhaustive characteristics of these banks to formulate tools DNAs according to this embodiment have a sequence 

which can be used, in particular, in diagnostics, treatment corresponding in all or part to SEQ ID No. 1,2, 4, 6, 7, 10, 

and prevention. 15, 31 or 34, and/or to any sequence located at more or less 

Tlie DNAs of the invention are characterized in that they 40 20 kb from these SEQ ID on the cliromosome of an Nm 

are in all or part genes, with their reading frame, present in strain, and/or have a sequence which is capable of hybrid- 

Neisseria meningitidis, but absent either from Neisseria izing with at least a fragment of any one of these sequences. 

gonorrhoeae and from Neisseria lactamica, with the excep- The invention especially provides all or part of the DNA 

tion genes involved in the biosynthesis of the polysaccharide sequence SEQ ID No. 36 of 15,620 bp, and the sequences 

capsule, fi:pA, firpC, opc, por A, rotamase, the sequence 45 corresponding to the open reading frames SEQ ID No. 37, 

IS1106 (Accession No. Z 11857 in the EMBL/GenBank/ SEQ ID No. 38, SEQ ID No. 39, SEQ ID No. 40, SEQ ID 

DDBJ Nucleotide Sequence Data Libraries; see Knight et No. 41, SEQ ID No. 42, SEQ ID No. 43, SEQ ID No. 44 and 

al., 1992, Molecular Microbiology 6(11): 1565-1573), IgA SEQ ID No. 45. 

proteases, pilin, pilC, proteins which bind transferrin and In yet another preferred embodiment of the invention, 

opacity proteins. 50 these DNAs are also characterized in that they are made up 

As stated above, the terms "present" and "absenf are of one or more sequence(s) present on the chromosome of 

interpreted on the basis of the hybridization conditions used Neisseria meningitidis Z2491 between argF and opaB, or 

in the Southern blotting described in the examples and region 3 of the chromosome, and/or the sequence(s) capable 

referred to above. of hybridizing with the above sequence(s), with the proviso 

It can be seen that these DNAs include variants where 55 of being specific to Neisseria meningitidis. 

these express a fimction intrinsic to the Nm species, more DNAs according to this embodiment are characterized in 

particularly a phenotype found solely in Nm or in conmion that they have a sequence corresponding in all or part to SEQ 

exclusively with Ng. ID No. 8, 21, 23, 25, 26, 28, 29, 32 or 35, and/or to any 

According to a main aspect, these DNAs are specific to sequence located at more or less 20 kb from these SEQ ID 

the pathogenicity of Neisseria meningitidis, in spite of the 60 on the chromosome of anNm strain, and/or have a sequence 

genetic variability of this species. which is capable of hybridizing with at least a fragment of 

According to an embodiment of the invention, the said any one of these sequences. 

DNAs are specific to Nm, in contrast to Ng. Regions 1, 2 and 3 identified above have a high proportion 

More particularly, the Nm-specific DNAs are absent from of sequences specific to Neisseria meningitidis and also fall 

Neisseria lactamica and from Neisseria cinerea. 65 within the context of the invention. 

Surprisingly, the majority of genetic differences between Other DNAs representative of the specificity with respect 

the strains of meningococci and those of gonococci appear to Neisseria meningitidis have one or more sequences which 
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is/are present on the cliromosome of Neisseria meningitidis 
Z2491 but are not part of regions 1, 2 and 3 defined above. 

Such DNAs comprise one or more sequence(s) corre- 
sponding in all or part to SEQ ID No. 3, 5, 11, 12, 14, 16, 
18, 19, 20, 24, 27 or 33, and/or to any sequence located at 5 
more or less 20 kb from these SEQ ID on the chromosome 
of an Nm strain, and/or have a sequence capable of hybrid- 
izing with such sequences. 

Taking into account the uses envisaged in particular, the 
invention more specifically relates to the above DNAs 10 
involved in the pathogenesis of the bacterial organism. 

In particular, it provides the DNAs corresponding to at 
least one of the characterizations given above and coding for 
a protein exported beyond the cytoplasmic membrane, and/ 
or of which all or part of their sequence corresponds to the 15 
conserved region of the said DNAs. 

According to another embodiment of the invention, the 
DNAs are thus common with those of Ng, but are absent 
from those of Nl. 

These are more specifically the DNAs which are present 20 
on region 4 (arg J to reg F) or on region 5 (lambda 375 
marker to pen A) on the chromosome of Nm Z2491 and/or 
are capable of hybridizing with the said DNAs present, with 
the proviso of being specific to Nm and Ng, in contrast to Nl. 

"Specific to Nm and Ng in contrast to Nl" means the 25 
DNAs which hybridize with the DNAs of Nm and Ng under 
the hybridization conditions of the examples (see example 4 
in particular). 

The DNAs of regions 4 and 5 and those capable of 
hybridizing with these DNAs, with the proviso of expressing 30 
the intrinsic fiinctions of Nm, have the advantage of inter- 
vening in a significant manner in the virulence of Nm, being 
involved in the stage of initial colonization and penetration 
and in the septicaemic dissemination. 

According to other embodiments, the invention provides 35 
transfer and expression vectors, such as plasmids, cosmids 
or bacteriophages, comprising at least one DNA as defined 
above. 

It also provides host cells transformed by at least one 
DNA as defined above. 40 

Other host cells of the invention comprise genes or gene 
fragments specific to Nm, and are characterized in that their 
chromosome is deleted by at least one DNA according to the 
invention, in particular a DNA responsible for the pathoge- 
nicity. They are more specifically bacterial cells, in particu- 45 
lar of Nm. 

The invention also relates to the RNAs of which the 
sequence corresponds in all or part to the transcription of at 
least one DNA sequence or sequence fragment as defined 
above. 50 

The invention also relates to the antisense nucleic acids of 
the DNAs as defined above, or of fragments of these DNAs. 

These antisense nucleic acids carry, where appropriate, at 
least one substituent, such as a methyl group and/or a 
glycosyl group. 55 

Other products which fall within the context of the 
invention include polypeptides. 

These polypeptides are characterized in that they have an 
amino acid chain corresponding to all or part of a sequence 
coded by the nucleic acids defined above, or deduced from 60 
sequences of these nucleic acids. 

They are advantageously polypeptides corresponding to 
all or part of the polypeptides exported beyond the cyto- 
plasmic membrane, more specifically polypeptides corre- 
sponding to all or part of those coded by a conserved region. 65 

As a variant, the polypeptides of the invention can be 
modified with respect to those corresponding to the nucleic 
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acid sequences such that they are particularly suitable for a 
given use, in particular use as a vaccine. 

Modification is understood as meaning any alteration, 
deletion or chemical substitution where this does not affect 
the biochemical properties of the corresponding natural 
polypeptides, more specifically of functional proteins 
exported at the periplasm and the external membrane. 

Other products according to the invention include anti- 
bodies directed against the above polypeptides. 

The invention thus provides polyclonal antibodies, and 
also monoclonal antibodies, characterized in that they rec- 
ognize at least one epitope of a polypeptide as described 
above. 

It also relates to fragments of these antibodies, more 
particularly the fragments Fv, Fab and Fab'2. 

The invention also relates to the anti -antibodies which are 
capable of recognizing the antibodies defined above, or their 
fragments, by a reaction of the antigen- antibody type. 

According to the invention, the various products consid- 
ered above are obtained by a synthesis and/or biological 
route in accordance with conventional techniques. 

The nucleic acids can also be obtained from banks made 
up of Nm-specific DNAs such as are formulated by a 
subtractive technique, this technique comprising: 

mixing of two DNA populations, 

realization of at least one subtractive hybridization- 
amplification iteration, and 

collection of the desired DNA or DNAs, followed, where 
appropriate, by its/their purification with elimination of 
redundant sequences. 

According to the invention, the two DNA populations 
originate respectively from a strain of Neisseria 
meningitidis, the so-called reference strain for which the 
specific bank must be constructed, and a strain of Neisseria, 
the so-called subtraction strain, having a homology in pri- 
mary DNA sequences of greater than about 70% with the 
Neisseria meningitidis strain, the DNA sequences of the 
subtraction and reference strains being obtained respectively 
by random shearing, and by cleavage by a restriction endo- 
nuclease capable of producing fragments less than about 1 
kb in size. 

The invention provides in particular a process for obtain- 
ing Neisseria meningitidis-specific DNA banks, comprising 
the stages of 

random shearing of the chromosomal DNA of a strain of 
Neisseria gonorrhoeae, the so-called subtraction strain, 
in particular by repeated passage through a syringe, 

cleavage of the cliromosomal DNA of a strain of Neis- 
seria meningitidis, the so-called reference strain, pref- 
erably by a restriction enzyme producing fragments 
less than about 1 kb in size, 

splicing of the DNA fragments of the reference strain, 
cleaved by the restriction enzyme, with suitable oligo- 
nucleotide primers, 

realization of a subtractive hybridization-amplification 
iteration, by: 

mixing of the two DNA populations under suitable 
conditions for hybridization of homologous 
sequences, and then 

amplification of auto-reannealed fragments and collec- 
tion of these fragments, 

digestion of these fragments by a restriction enzyme 
and re-splicing with oligonucleotide primers, fol- 
lowed by a 

purification of the spliced DNA and, where appropriate, a 
new iteration of the subtractive hybridization, compris- 
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ing mixing of DNA fragments of Neisseria gonor- 
rhoeae sheared as indicated above with DNA fragments 
of Neisseria meningitidis produced by the preceding 
iteration, fohowed, if desired, by cloning of the DNAs 
of the bank. 5 

The primers used are oligodeoxynucleotide primers 
which are suitable for the restriction endonuclease used and 
allow insertion into a cloning site, such as the EcoRI site of 
the plasmid pBluescript. Such primers will advantageously 
be chosen among the oligodeoxynucleotides referred to in 10 
the sequence listing under SEQ ID no. 36 to 45. 

The banks thus obtained are formed from DNAs which 
are specific to meningococci and absent from gonococci. 

The specificity of the DNAs was verified, as described in 
the examples, at each iteration by Southern blots, with genes 15 
common to the subtraction strain and to the reference strain, 
or with the total DNA of each of the strains digested by a 
restriction endonuclease, such as Clal. 

At each iteration, the exhaustivity of the DNA bank was 
also verified by Southern blotting with probes known to be 20 
specific to the reference strain, that is to say for Neisseria 
meningitidis the fi:p, opc and rotamase genes in particular. 

The experiments carried out showed that the banks 
obtained by the process of the invention are deficient in 
genes having a significant homology with species of Neis- 25 
seria other than Neisseria meningitidis, for example the ppk 
or pilCl genes, generally in only 2 or 3 iterations. 

If necessary, two routes, which are not exclusive of each 
other, can be taken. 

It is possible to proceed with an (n+ 1)^^ iteration using the 30 
DNA of iteration n as the DNA population of the reference 
strain. 

As a variant, a second bank independent of the first is 
constructed, with a restriction enzyme of different specificity 
to that used in the first bank, for example Mbol. 35 

In all cases, it is preferable to keep each of the products 
produced by each of the iterations performed. 

The invention also provides the use of the subtractive 
technique described above to obtain banks of the DNAs 
common to Nm and Ng, but specific with respect to Nl. 40 

Three different banks are advantageously constructed, 
two of them by digestion of the cliromosomal DNA of Nm 
by Mbol and Tsp5091, and the third by digestion of the 
chromosomal DNA of Nm with Mspl. Two subtraction 
series allow the DNAs having the required specificity to be 45 
collected, as described in the examples. 

The invention also relates to the process for obtaining 
these banks and the banks themselves. 

It can be seen that, generally, the process of the invention 
can be used to obtain banks of DNAs specific to a given cell 50 
species, or to a given variant of the same species, where 
another species or another variant which is close genomi- 
cally and expresses different pathogenic potencies exists. 

Using the process of the invention, DNA banks specific to 
given species of cryptococci, Haemophilus, pneumococci or 55 
also Escherichia coli, or more generally any bacterial agent 
belonging to the same species and having different pathovars 
will advantageously be constructed. 

Furthermore, from these banks the invention provides the 
means to have available virulence factors specific to a 60 
species or a given variant. 

Such banks are therefore tools which are of great interest 
for having available attributes which are responsible for the 
specificity of a pathogen, this use being more specifically 
illustrated according to the invention by the obtaining of 65 
banks comprising the attributes responsible for the specific- 
ity of the meningococcal pathogenesis. 
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Study of the products of the invention, the nucleic acids, 
polypeptides and antibodies, has enabled an absolute speci- 
ficity with respect ot Neisseria meningitidis, regardless of 
the strain and its variability, to be demonstrated. 

These products are therefore particularly suitable for 
diagnosis or prevention of infections and meningitis caused 
by Neisseria meningitidis, whether in adults or children and 
regardless of the serogroups of the strain in question. 

The method for diagnosis, according to the invention, of 
a meningococcal infection, and more particularly of men- 
ingococcal meningitis, by demonstration of the presence of 
Neisseria meningitis in an analytical sample is characterized 
by the stages of: 

bringing into contact a sample to be analysed, that is to 
say a biological sample or a cell culture, and a reagent 
formulated from at least one nucleic acid as defined 
above, if appropriate in the form of a nucleotide probe 
or a primer, or, as a variant, from at least one antibody 
or a fragment of an antibody as defined above, under 
conditions which allow, respectively, hybridization or a 
reaction of the antigen-antibody type, and 

detection of any reaction product fomied. 

If the reagent is formulated from a nucleic acid, this can 
be in the form of a nucleotide probe in which the nucleic acid 
or a fragment of the latter is labelled in order to enable it to 
be detected. Suitable markers include radioactive, 
fluorescent, enzymatic or luminescent markers. 

As a variant, the nucleic acid is included in a host cell, 
which is used as the reagent. 

In these various forms, the nucleic acid is used as such or 
in the form of a composition with inert vehicles. 

If the reagent is compiled from an antibody, or a fragment 
of an antibody, this can be labelled for detection purposes. 
Most generally, a fluorescent, enzymatic, radioactive or 
luminescent marker is used. 

The antibody or the antibody fragment used, which is 
labelled if appropriate, can be used as such or in the form of 
a composition with inert vehicles. 

The sample used in the stage of bringing the components 
into contact is a biological sample produced by a mammal, 
such as cephalorachidian fluid, urine, blood or saliva. 

The detection stage is carried out under conditions which 
allow the reaction product to be demonstrated when it is 
formed. Conventional means use fluorescence, 
luminescence, colour or radioactive reactions, or also auto- 
riadography [sic] techniques. It is also possible to quantify 
the product. 

The invention also relates to the labelled products, the 
nucleic acids and antibodies, as new products. 

The method defined above can be used for diagnosis of an 

immune reaction specific to a meningococcal infection. 

The reagent used is thus a polypeptide according to the 
invention, as coded by the said nucleic acid sequences, 
corresponding to the natural product or a polypeptide which 
is modified but has the biological and immunological activ- 
ity of the corresponding natural polypeptide. 

It is advantageously a polypeptide exported beyond the 
cytoplasmic membrane of Neisseria meningitidis, more par- 
ticularly the part of such a polypeptide corresponding to the 
conserved region of the DNA. 

The invention also relates to kits for carrying out the 
methods defined above. These kits are characterized in that 
they comprise: 

at least one reagent as defined above, that is to say of the 

nucleic acid, antibody or polypeptide type, 
products, in particular markers or buffers, which enable 
the intended nucleotide hybridization reaction or 
immunological reaction to be carried out, as well as use 
instructions. 
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Hie specificity of the products of the invention and their 
location on the cliromosome of Neisseria meningitidis 
Z2491, either grouped in a region and able to be interpreted 
as pathogenicity islets, or isolated on the cliromosome, 
impart to them a very particular interest for realization of 
vaccine compositions with a universal purpose, that is to say 
whatever the strain and the variability which it expresses. 
These compositions can include in their spectrum other 
prophylaxes, and can be, for example, combined with child- 
hood vaccines. 

The invention thus provides vaccine compositions which 
include in their spectrum antimemngococcal prophylaxis, 
intended for prevention of any infection which may be 
caused by Neisseria meningitidis, these compositions being 
characterized in that they comprise, in combination with (a) 15 
physiologically acceptable vehicle(s), an effective amount of 
polypeptides or anti -antibodies or their fragments as defined 
above, these products optionally being conjugated, in order 
to reinforce their ironiogenicity [sic]. 

Immunogenic molecules which can be used comprise the 20 
polio virus protein, the tetanus toxin, or also the protein 
produced by the hypervariable region of a pilin. 

As a variant, the vaccine compositions according to the 
invention are characterized in that they comprise, in com- 
bination with (a) physiologically acceptable vehicle(s), an 25 
effective amount: 

of nucleic acids as defined above, 

of transformed host cells as defined above, or 

of Nm cells, the chromosome of which has been deleted 
by at least one DNA sequence according to the invention 
involved in the pathogenicity of the bacterium. The nucle- 
otide material used is advantageously placed under the 
control of a promoter of its expression in vivo and synthesis 
of the corresponding protein. To reinforce the 
immunogenicity, it is also possible to combine this nucleic 
material with a DNA or an RNA which codes for a carrier 
molecule, such as the poliovirus protein, tetanus toxin or a 
protein produced by the hypervariable region of a pilin. 

The vaccine compositions of the inventions can be admin- 
istered parenterally, subcutaneously, intramuscularly or also 
in the form of a spray. 

Other characteristics and advantages of the invention are 
given in the examples which follow for illustration thereof, 
but without limiting its scope. 
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FIGS. 9, 10 and 11: the reactivity of clones of the 3 
subtractive banks with respect to Nm, Nl and Ng. 

In the examples which follow, the following materials and 
methods were used: 

Bacterial strains — To obtain the subtractive banks, strain 
Z2491 of Nm (Achtman et al., 1991, J. Infect Dis. 164, 
375 382), the strains MSll (Swanson et al., 1974, Infect. 
Immun. 10, 633-644) and the strains 8064 and 9764 of Nl 
were used, it being understood that any other strain of the 
species in question could be used. 
In order to verify the specificity of these banks, 6 strains 

of Nm, 4 strains of Ng, one strain of Nl {Neisseria 

lactamica) and one strain of Nc (Neisseria cinerea) were 

used. 

The six strains of Nm are: Nm Z2491 of serogroup A, Nm 
8013 of serogroup C QW collection), Nm 1121, no sero- 
grouping possible (XN collection), Nm 1912 serogroup A 
(XN collection), Nm 7972 of serogroup A pCN collection) 
and Nm 8216 of serogroup B (XN collection). 

The four strains of Ng are: Ng MS 1 1 (Pasteur Institute, 
Paris), Ng 403 (Pasteur Institute, Paris), Ng 6934 (Pasteur 
Institute, Paris), Ng WI (isolated fi*om a disseminated gono- 
coccal infection), Ng 4C1, Ng 6493 and Ng FA 1090. 

The strains of Nl are Nl 8064 and Nl 9764 (XN 
collection), and that of Nc is Nc 32165 (XN collection). 
Molecular Genetics Techniques 

Unless indicated otherwise, the tecliniques and reagents 
used correspond to those recommended by Sambrook et al 
(Sambrook et al 1989, Molecular Cloning: A Laboratory 
Manual. Cold Spring Harbor Laboratory Press). The oli- 
godeoxynucleotides used in this study are: 
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BRIEF DESCRIPTION OF THE SEVERAL 
VIEWS OF THE DRAWING(S) 

In these examples, reference will be made to FIGS. 1 to 
11, which show, respectively, 

FIGS. lA, IB, IC, ID, IE, IF and IG: analysis of the 
subtractive bank Tsp5091, 

FIG. 2: the distribution of the Nm-specific sequences, in 
contrast to Ng, on the chromosome of the strain Z2491 
(left-hand part) and of Nm-specific sequences, in contrast to 
Nl (right-hand part), 

FIGS. 3A to 3C: the reactivity of the clones of the 3 
regions of the chromosome according to the invention 
towards a panel of strains of the genus Neisseria, 

FIG. 4: the position in region 2 of the cliromosome of Nm 
of oligonucleotides used as probes, 

FIGS. 5, 6 and 7: the Southern blots of a panel of strains 
of the genus Neisseria, using parts of region 2 of Nm as 
probes, 

FIGS. 8 A to 8C: the Southern blots with 3 subtractive 
banks over a panel of 12 strains of Neisseria, and 
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(SEQ ID No. 54) 


RBAml2 , 


3' 


AGTGGCTCCTAG 54 






(SEQ IN No. 55) 


RBam24, 


5' 


AGCACTCTCCAGCCTCTCACCGAG 3 ' ; 






(SEQ ID No. 60 ) 


Jbaml2 , 


3' 


GATCCGTTCATG 5 ' ; 






(SEQ ID No. 61) 


JBAM24, 


5' 


ACCGACGTCGACTATCCATGAACG 3 ' ; 






(SEQ ID No. 56) 


REcol2 , 


AGTGGCTCTTAA; 






(= RBam 24, SEQ ID NO. 55) 


REco24 , 


5 ' 


AGCACTCTCCAGCCTCTCACCGAG 3 ' ; 






(SEQ ID No. 62) 


JEcol2 , 


GTACTTGCTTAA; 






(= JBam 24) 


JEco24, 


5' 


ACCGACGTCGACTATCCATGAACG 3 ' ; 






(SEQ ID No. 64) 


NEcol2, 


AATTCTCCCTCG; 






(SEQ ID No.65) . 


NEco24, 


AGGCAACTGTGCTATCCGAGGGAG ; 



50 



Transfer to Membranes (Southern Blots) 

The transfers to membranes were eflected by capillary 
transfers to positively charged nylon membranes 
(Boehringer Mannheim). The hybridizations were carried 
out at 65° C. in a solution comprising NaPi [sic] 0.5 M pH 
7.2/EDTA 1 niM/SDS 7%/BSA 1%. The membranes were 
washed in a solution comprising NaPi [sic] 40 mM pH 
7.2/EDTA 1 mM/SDS 1%. The final washing was carried out 
at 65° C. for 5 min. 

The probe frp obtained with oligonucleotides based on the 
fipA sequence corresponds to 2.4 kb of the 5' end of the gene 
of the strain Z2491. The opc and rotamase probes corre- 
sponding to whole genes are produced from the strain Z2491 
using oligonucleotides constructed on the basis of published 
sequences. The probes pilCl and ppk (polyphosphate kinase) 
correspond to inserts of the plasmids pJLl and 
pBluePPK6001 respectively. 
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EXAMPLE 1 method are labelled and used as a probe for the DNA 

^ . nx^i (-T^^T^T^ .-VT 1 digested by Clal produced from a panel of six strains of 

Construction of Banks of DNAs Present m Nm and Neisseria meningitidis, four of Neisseria gonorrhoeae, one 

Absent from Ng of Neisseria lactamica and one of Neisseria cinerea. 

a. "Mbol" Bank 5 The Southern blots obtained show that the amplified 

Construction — ^The DNA of Nm Z2491 was cleaved by sequences resulting from the second iteration of the CDA 

the endonuclease Mbol and subjected to two iterations of a method have a high reactivity with several bands corre- 

method called CDA (comprehensive difference analysis) sponding to meningococci, and do not have a reactivity with 

below. This method comprises subtractive hybridization in the bands corresponding to the Ng, Nl and Nc strains, 
the presence of excess sheared DNA of Ng MSll and lo The "MboF' bank thus appears to be Nm-specific. 

amplification by PCR of those meningococcal sequences Exhaustivity — In order to test the exliaustivity of the 

which, since they are absent from or do not have significant bank, all the products produced from the first and second 

homology with the DNA of Ng MSll, could reanneal. iterations of the CDA method and also the initial chromo- 

The chromosomal DNA of the strain Ng MSll is sheared somal materials of Nm Z2481 [sic] and Ng MSll are 

randomly by repeated passage through a hypodermic syringe subjected to agarose gel electrophoresis, transferred to a 

until fi-agments of a size ranging from 3 to 10 kb are membrane and brought into contact with probes comprising 

obtained. These DNA fragments are purified by extraction genes known to be meningococcus- specific, that is to say 

with phenol. frp, opc and rotamase (Southern blotting). 

The chromosomal DNA of the strain Nm Z2491 is itself As a result of these hybridizations, the Nm-specific gene 

cleaved by the restriction endonuclease Mbol. These DNA frp is represented in the Mbol bank by a fragment of 600 bp, 

fragments (20 ^g) are spliced with 10 nmol of annealed but no activity is observed for the rotamase and opc genes, 

oligonucleotides RBaml2 and RBam24. The excess primers The Mbol bank, although Nm-specific, therefore cannot be 

are removed by electrophoresis over 2% agarose gel of low considered exhaustive. 

melting point. The part of the gel containing amplified Given their high specificity, the fragments produced by 

fragments greater than 200 bp in size is excised and digested the second iteration of the CDA method for the Mbol bank 

by p-agarase. These fragments are purified by extraction can nevertheless be cloned on the BamHI site of the plasmid 

with phenol. pBluescript. 

To carry out a subtractive hybridization (first iteration), A sequence corresponding to any of the Nm-specific 

0.2 |xg of the Nm DNA spliced with the RBam oligonucle- genes can be included in the subtractive bank only if it is 

otides is mixed with 40 |xg Ng DNA in a total volume of 8 carried by a restriction fragment of appropriate size. This 

ml of a buflFer EE 3x (a buffer EE Ix is composed of condition is a ftinction of two factors. Firstly, the probability 

N-(2-hydroxyethyl)piperazine-N'-(3-propanesulphonic that the largest fragments are entirely Nm-specific is low. 

acid) 10 mM and EDTA 1 mM, and its pH is 8.0). This Secondly, even if such fragments existed, they would be 

solution is covered with mineral oil and the DNA is dena- under-represented in the bank because of the limitations of 

tured by heating at 100° C. for 2 min. 2 \\\ NaCl 5 M are the PGR teclinique, the amplification effectiveness of which 

added and the mixture is left to hybridize at 55° C. for 48 h. decreases with increasing size of the fragments. Fragments 

The reaction mixture is diluted to 1/10 in a preheated greater that about 600 bp in size are not included in the bank, 

solution composed of NaCl and buffer EE, and in then Because of the absence of Mbo fragments of suitable size 

immediately placed on ice. from the chromosome of Nm Z2491, the rotamase and opc 

10 ^1 of this dilution are added to 400 |li1 of PCR reaction genes cannot be included in the bank. Any enzyme cannot by 

mixture (Tris.HCl pH 9.0 10 mM; KCl 50 mM; MgCl2 1.5 itself produce a small fragment corresponding to any 

mM; Triton XI 00 0.1%; 0.25 mM of each of the four Nm-specific gene. A second bank was therefore constructed 

triphosphate deoxynucleotides; Taq polymerase 50 units per using another restriction enzyme with a different specificity: 

ml). The mixture is incubated for 3 min at 70° C. to complete Tsp509 [sic], 

the ends of the reannealed meningococcal DNA fragments. b. "Tsp509r' Bank 

After denaturing at 94° C. for 5 min and addition of the Construction — ^The enzyme Tsp5091 has the advantage of 

oligonucleotide RBam24 in an amount of 0.1 nmol per 100 producing fragments of smaller size (less than about 1 kb) 

}j,l, the hybridizations are amplified by PCR (30 cycles of 1 than the enzyme Mbol. 

min at 94° C, 1 min at 70° C. and 3 min at 72° C, followed Tsp509I recognizes the sequence AATT and leaves, pro- 

by 1 min at 94° C. and 10 min at 72° C; Perkin-Ehner jecting at 5', a sequence of 4 bases compatible with EcoRI. 

GeneAmp 9600). The oligonucleotides used are Reco, Jeco and NEco. 

The amplified meningococcal fragments are separated The method followed conforms with that followed for 

from the primers and high molecular weight gonococcal construction of the "Mbol" bank described above. However, 
DNAs on gel. They are digested by Mbol and the oligo- 55 higher quantities of meningococcal DNA were used for the 

nucleotides JBaml2 and JBam 24 are spliced to them again. first iteration of the subtractive hybridization in order to 

These spliced DNAs are again purified over gel and compensate for the liigher number of fragments of low 

extracted with phenol. molecular weight produced by Tsp509I. For the first 

A second iteration of the subtractive hybridization is iteration, 400 ng Nm DNA fragments and, in the second, 25 
carried out on 40 fig of the randomly sheared Ng DNA and 60 ng Nm fragments are subjected to subtractive hybridization 

25 ng of the DNA spliced with the JBam oligonucleotides with 40 |xg randomly sheared Ng DNA. 

obtained from the first iteration of the subtractive hybrid- For the construction of this "Tsp509r' bank, as a control, 

ization. During this second iteration, amplification of the a third iteration of the subtractive hybridization is carried out 

auto-annealed Nm DNA is effected with the aid of the using 40 }ig sheared Ng DNA and 0.2 ng Nm fragments 
oligonucleotide JBam24. 65 resulting from a digestion by Tsp509I and a resplicing, with 

Specificity — ^In order to confirm their Nm specificity, the NEco adaptors, of the fragments obtained as a result of the 

amplified sequences after the second iteration of the CDA second iteration. 
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Specificity — ^As described for the previous baiik, the The bank produced by Tsp509I thus constitutes a source 

product resuhing from the second iteration of the CD A of Nm-specific clones, 
method is labelled and used as the probe for a panel of 

strains of Neisseria. EXAMPLE 2 

FIG. 1 A illustrates the Southern blot hybridization of 5 . . . .i ^i ^ ^i o i . ^- i 

products of the seconditerationof the CDA method with the ^^^^^^^ ^^^^^ ^^^^^^ ^^^^ Subtractive Bank 

DNA digested by Clal of: Nm in track a, Ng MSI 1 in track Cloning and Sequencing of the Nm-Specific DNAs 

b, Nm 8013 in track c, Ng 403 in track d, Nm 1121 in track j^e DNAs of the subtractive banks are clones at the 

e, Ng 6934 m track f, Nm 1912 m track g, Ng WI (strain g^j^HI (Mbol bank) or EcoRI (Tsp509I bank) site of the 
DGI) m track h, Nm 7972 m track i, Nl 8064 in track j, Nc 10 pjasmid pBluescript, and then transformed in DH5a of E. 

32165 m track k, Nm 8216 m track 1. The inserts are ampHfied by PGR carried out on the 

In contrast to the high reactivity observed with all the Nm transformed colonies using the primers M13-50 and M13- 

strains, a low or no reactivity is observed with the Ng, Nl and 40, the latter primer being biotinylated on its 5' end. 

Nc strains. Sequencing was carried out on each PGR product after 

The specificity of the bank was studied eariier by reacting ^ separation of the biotinylated and non-biotinylated strands 

membrane transfers (Southern blots) of the products pro- using the system of Dynabeads M-280 with streptavidin 

duced by each of the three iterations of the GDA method (Dynal, Oslo). The sequences are screened according to their 

with probes corresponding to pilGl and ppk. These two homologies with previously published sequences using the 

genes are common to Nm and Ng. computer programs Blastn and Blastx (NGBI, USA and 

FIG. IB shows an agarose gel after electrophoresis of the Fasta). 

chromosomes of Nm Z2491 and Ng Msll, digested by The PGR products resulting from the transformed bacteria 

Tsp509 [sic], and products resuhing from each of the itera- colonies after using the primers M13-40 and M13-50 as 

tions of the GDA method. described above were labelled by incorporation with random 

In track a 1 |Lig of the chromosome of Nm was deposited, 25 priming of a-^^P-dGTP and were used as a probe for the 

in track b 1 jig of that of Ng, in track c 0.15 jig of the membrane transfers of the chromosomal DNA digested by 

products resulting from the first GDA iteration, in track d 0. 1 Glal of strains Nm Z249 1 and Ng MSI 1 , as described above, 

|xg of those of the second iteration, in track e 0.05 ^ig of the in order to verily their specificity. 

third iteration, MW representing the molecular size markers. Mapping of Glones on the Ghromosome of the Strain Nm 

FIGS. IG and ID show gels obtained as described in FIG. 30 Z2491 . 

IB after transfer to the membrane (Southern blots) and ^he resuhs of studies carried out with 17 clones of the 

hybridization with pilGl (FIG. IG) and ppk (FIG. ID). «j^qJ„ (designated by the letter B) and 16 clones of 

At the end of the second iteration of the GDA method, the the "Tsp5091" bank (designated by the letter E), each of 

sequences corresponding to the pilGl and ppk genes are these clones having a unique sequence and being without 
completely excluded from the bank. 35 counterpart in Ng, are reported. 

Exhaustivity— The exhaustivity of the bank was exam- jhe positions of the DNA sequences corresponding to 

ined by reacting the products resulting from the subtractive cloned Nm-specific products were determined with respect 

hybridization with the probes corresponding to three to the published map of the chromosome of Nm Z2491 

Nm-specific genes (frp, rotamase and opc). (Dempsey et al. 1995, J. Bacteriol. 177, 6390-6400) and 

These Nm-specific probes react with the amplification with the aid of transfers to membranes (Southern blots) of 

products resulting fi*om the first and second iteration of the agarose gel subjected to pulsed field electrophoresis 

subtractive hybridization. (PFGE). 

FIGS. IE, IF and IG show gels obtained as described in The Nm-specific clones are used as probes for a hybrid- 

FIG. IB after transfer to the membrane (Southern blots) and ization on membranes (Southern blots) of the DNA of Nm 

hybridization with firpA (FIG. IE), rotamase (FIG. IF) and Z2491 digested with enzymes of rare cutting sites, that is to 

opc (FIG. IG). say Pad, Pmel, Sgfl, Bglll, Spel Nhel and Sgfl. 

However, a third iteration of the subtractive hybridization The gels (20x20 cm) were gels of 1% agarose in a buffer 

leads to the loss of Nm-specific sequences, since the frag- TBE 0.5x and were subjected to electrophoresis at 6 V/cm 

ments which react with the rotamase and opc genes are for 36 hours according to pulsation periods varying linearly 
absent from this third iteration. ^ between 5 and 35 seconds. 

In consideration of all these data, it emerges that the The hybridizations on the membrane (Southern blots) 

products resulting from the second iteration of the GDA were carried out as described above, 

method are Nm-specific and also constitute an exhaustive jhe resuhs obtained are shown on FIG. 2: the reactivity 
bank of Nm-specific sequences. 55 ^^s located by comparison with the positions of the frag- 

The products resulting from this second iteration are nients of corresponding size on the published map. The 

cloned at the EcoRI site of the plasmid pBluescript. positions of all the genetic markers mapped by Dempsey et 

The bank produced by Tsp509I is more exhautive [sic] al (mentioned above) are visualized with the aid of points on 

than the bank produced by Mbol, as the theory consider- the to linear chromosomal map. The Nm-specific genes 
ations based on the enzymatic production of smaller restric- 60 disclosed previously are labelled with an asterisk. The two 

tion fi-agments would suggest. loci called "fi:p" correspond to the frpA and fii)c genes. The 

In accordance with this aspect, it should be noted that the "pilG" loci correspond to the pilGl and pilC2 genes, which 

Tsp509I bank is less redundant than the Mbol bank, that is are pairs of homologous genes and are not distinguished on 

to say it comprises less duplication of clones. 86% of the the map. The accuracy of the positions of the Nm-specific 
clones of the Tsp509I bank correspond to distinct sequences, 65 clones of the invention depends on the overlapping of 

while only 43% of the clones correspond to distinct reactive restriction fragments. On average, the position is 

sequences in the Mbol bank (data not shown). +/-20 kb. 
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This mapping reveals a non-random distribution of the 
Nm-specific sequences. The majority of the Nm-specific 
sequences belong to tliree distinct groups. One of these 
groups (region 1) corresponds to the position of genes 
relating to the capsule which have been described previ- 5 
ously. 

A distinction is made between: 

E109, E138, B230 and B323 as being region 1, 

B322, B220, B108, B132, B233, B328, E139, E145 as 
BlOl as being region 2, and 

B306, E114, E115, E124, E146, E120, E107, E137 and 
142 as being region 3. 
63% of the sequences identified as specific to meningococci 
are located inside these three distinct regions. 15 

This grouping contrasts with the distribution of previ- 
ously disclosed Nm-specific genes (frpA, fi:pC, porA, opc 
and the region relating to the capsule). 

This prior art would suggest in fact that the Nm-specific 
genes, with the exception of functional genes relating to the 20 
capsule, were dispersed along the cliromosome. 

Mapping of Nm-specific sequences on the chromosome 
leads to an unexpected result with regard to the prior art. 

The majority of the genetic differences between the 
meningococcal and gonococcal strains tested are grouped in 25 
three distinct regions. 

Meningococcal genes relating to the capsule are grouped 
in region 1. 

The function of genes of the other regions is unknown, but 
homologies with published sequences (table 1) suggest 30 
similarities between certain genes of region 3 and bacte- 
riophage transposase and regulatory proteins. No meningo- 
coccal virus has been characterized and it is tempting to 
think that these sequences are of phagic origin. Interestingly, 
the genome of H. influenzae also contains a sequence 35 
homologous to that of the Ner regulatory protein of phage 
Mu, but it is not known if it is a functional gene. 

The clone B208 has a highhomology (48% identical, 91% 
homology for 33 amino acids) with a clone of conserved 
regions field III) in the class of proteins which bind to 40 
TonB -dependent ferric siderophors. 

The proximity of this clone with the Nm-specific porA 
genes and the frp genes regulated by iron, and in particular 
the possibility that it is an Nm-specific receptor protein 
exposed on the external membrane in itself is a good 45 
candidate for further research. 

The clone B339 corresponds to the Nm-specific insertion 
sequence IS 11 06. 

The low homology between the clone B134 and the 
Aeromonas insertion sequence and also the presence of 
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multiple copies of the clone B134 among the various strains 
of Nm suggest that it could be a new type of Nm-specific 
insertion sequence. 

The possibility that the regions containing the 
Nm-specific clones could correspond to pathogenicity islets 
as described previously for E. coli and Y. pestis is of 
particular interest. 

The clones isolated in this invention will allow better 
understanding of the relevance of Nm-specific regions in 
allowing cloning and sequencing of larger cliromosomal 
fragments, and directly by their use for loci mutations. 

Finally, detection of meningococcus -specific genes pos- 
sibly involved in the pathogenicity of the organism allows 
targeting of suitable antigens which can be used in an 
antimeningococcal vaccine. 

The effectiveness and the speed of the method according 
to the inventions enables it to be used in a large number of 
situations for which the genetic differences responsible for a 
phenotype peculiar to one of 2 close pathogens are investi- 
gated. 

Study of the Reactivity of the Clones of Regions 1, 2 and 3 
Towards a Panel of Strains of Neisseria. 

The PGR products corresponding to inserts of each of the 
clones were collected and used as probes for hybridization 
on membranes (Southern blots) for a panel of strains of Nm, 
Ng, Nl and Nc. 

Regions 1 and 2 produce a limited number of bands for 
each of the meningococci. This suggests that these regions 
are both Nm-specific and common to all the meningococci. 

FIG. 3 illustrates the reactivity of the clones of regions 1, 
2 and 3 towards a panel of neisserial strains. The clones of 
regions 1 (FIG. 3A), 2 (FIG. 3B) and 3 (FIG. 3C) taken 
together were used as probes towards a panel of 
meningococci, gonococci and towards a strain of Nl and Nc. 

The tracks are as follows: DNA of: Nm Z2491 in track a, 
of Ng MSll in track b, of Nm 8013 in track c, of Ng 403 
in track d, of Nm 1121 in track e, of Ng 6934 in track f, of 
Nm 1 912 in track g, of Ng WI (strain DGI) in track h, of Nm 
7972 in track i, of Nl 8064 in track j, of Nc 32165 in track 
k, and of Nm 8216 in track 1. 

Remarkably, region 3 has reactivity only with the men- 
ingococci of serogroup A. This region 3 is therefore specific 
to a sub-group of Nm. 

A comparison was made with the known sequences in the 
databanks in order to evaluate the possible functions of the 
cloned regions. 

Table 1 which follows gives the positions of specific 
clones on the chromosomal map and the homologies with 
known sequences. 



TABLE 1 



Position of specific clones on the cluomosomal map and homologies with known 
sequences 

Size Reactive Position 

Name of of fragments on Homologies of protein 

clone* insert Pac Pmc Bgl Spe Nhe Sgf Z2491 sequences 

B305 259 18-20 15-17 22-23 18 11- 2 X736 

13 

B333 235 15-17 22-23 18 11- 2 X736 

13 

E109^^ 211 6-7 11-15 10 11- 2 tufA protein LipB 

13 ctrA N. meningitidis 

(3 X 10-26) 
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TABLE 1 -continued 

Position of specific clones on the chromosomal map and homologies with known 
sequences 

Size Reactive Position 



Name of 


of 




fragments 








on 


Homologies of protein 


clone* 


insert 


Pac 


Pmc 


Bgl 


Spe 


Nhe 


Sgf 


Z2491 


sequences 


E138^^ 


315 


1 


6—7 


11—15 


10 


11- 


2 


tufA 


protein LipB 














13 




ctrA 


N. meningitidis 




















(4 X 10-^5) 


B230^ 


356 


1-3 


6—7 


1 


10 


11- 


2 


ctrA 


protein KpsC E. coli 














13 






(3 X 10-^3) 


B323^ 


363 


1 


6—7 


1 


10 


11- 


2 


CtrA 


protein CtrB 














13 






A^. meningitidis (2 x 10^) 


B3222 


210 




2 


16—1 8 


6 


1 


5 


pilQ/X 


HlyB S. marcescens 


















740 


(4 X 10-^5) 


B2202 


341 




2 


16-18 


6 


^18 


5 


pilQ/X 




















740 




B108 


275 




2 


19-21 


6 


^18 


5 


pilQ/X 




















740 




B132 


411 


2 


2 


19-21 


6 


> 1 Q 

^ is 


5 


pilQ/X 




















740 




B233'^ 


164 


1-3 


2 


19-21 


5 


^18 


5 


pilQ/X 




















740 




B328^ 


256 


1-3 


2 


22-23 


6 


^18 


5 


pilQ/X 




















740 




E139^ 


324 


2 


2 


19-21 


6 


^18 


5 


pilQ/X 




















740 




EI452 


343 


2 


2 


19-21 


6 


^18 


5 


pilQ/X 




















740 




BlOl^ 


254 


^20 


2 


19—2 1 


6 


^18 


5 


pilQ/X 




















740 




E103q 


334 




2 


11-15 


3-5 


10 


3 


;^644 




B326§ 


314 




2 


11-15 


3^ 


10 


3 


K644 




B326 (low 






5 


6 


16 


2 


1 


argF 




reactivity) 




















B342 


167 




2 


19 


3^ 


6-7 


3 


iga 




E136 


249 




2 


7 


1 


3 


3 


lepA 




B208 


177 




1 


2 


3^ 


2 


1 


porA 


Felll pyochelin receptor 




















P. aeruginosa (5. 10"^) 


= B306^ 


219 


11 


5 


11-12 


5 


2 


4 


parC 




EII43 


227 


11 


5 


11-12 


5 


2 


4 


parC 




E115^^ 


251 




5 


11-15 


5 


2 


4 


parC 




EI243 


208 




5 


11-12 


5 


2 


4 


parC 




E146^ 


146 




5 


11-15 


5 




4 


parC 




E120^ 


263 




5 


3^ 


5 


16 


4 


opaB 




E107^ 


248 


11 


14—17 


3^ 


5 


16 


4 


opaB 




E137^ 


274 




14-17 


3^ 


5 


16 


4 


OpaB 


Transposase 




















Bacteriophage D3112 




















(6 X 10-^^) 


E142^ 


230 




14-17 


3^ 


5 


16 


4 


OpaB 


Protein Ner-Likc 




















H. influenzae (6 x 10"^^) 




















Protein binding to tlie DNA 




















Ner, phage mu (3 x 10"^^) 


E116 


379 


5-7 


11-13 


3^ 


2 


6-7 


8 






B313 


436 


9 


9 


3^ 


13- 


5 


2 


X611 














14 










B341 


201 


8-10 


9 


3^ 


13- 


5 


2 


X611 














14 










E102 


238 




11-13 


3^ 


19 


5 


2 


>.601 


Hypothetical protein 




















H11730/f. influenzae 




















(7 X 10-24) 


B134 


428 






multiple 










transposase ISAS2 




















Aeromonas 




















salmonicida (5 x 10"^) 


B339 


259 






multiple 










transposase IS 1106 




















N. meningitidis (6 x lO"'^^) 



The level of homologies found, as given by the Blastx program, are indicated in parentheses 

*The clones labelled with the index "1", "2" or "3" belong to regions "1", "2" or "3" respectively of the 

chromosome of N. meningitidis Z2491. 

^"^E109 and E138 are contiguous clones §B306 and El 15 overlap *'B236 also has a low reactivity in the 
region of arg F 

q) Clone El 03 contains a Tsp509 I site and can therefore contain two inserts; however, since it reacts only 
with a single fragment Clal (Oks) of the chromosome of N. meningitidis Z2491 and occupies only one posi- 
tion on the map, this clone is included here. 
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Firstly, it can be seen that the clones of region 1 all 
correspond to genes involved in biosynthesis of the capsule. 
These genes have previously been studied among the Nm of 
serogroup B (Frosch et al. 1989, Proc. Natl. Acad. Sci. USA 
86, 1669-1673 and Frosch and Muller 1993, Mol. Micro- 5 
biol. 8 483-493). 

Wiih the exception of a low homology with the haemol- 
ysin activator of Serratia marcescens, the clones of region 2 
have no significant homology with published sequences, 
either in the DNA or the proteins. 10 

Two of the clones of region 3 have interesting homologies 
with proteins which bind to the DNA, in particular the 
bacteriophage regulatory proteins and transposase proteins. 

Clone B208 has a high homology with one of the con- 
served regions in one class of receptors (TonB-dependent 15 
ferric siderophor). 

Clones B134 and B339 hybridize with several regions of 
the chromosome (at least 5 and at least 8 respectively). 

Data relating to the sequences show that clone B339 
corresponds to the Nm-specific insertion sequence SI 106. 20 

The translation of the clone B143 has a limited homology 
with the transposase of an Aeromonas insertion sequence 
(SAS2) (Gustafson et al. 1994, J. Mol. Biol. 237, 452^63). 
We were able to demonstrate by transfer on a membrane 
(Southern blots) that this clone is an Nm-specific entity 25 
present in muhiple copies in the chromosomes of every 
meningococcus of the panel tested. 

The other clones have no significant homology with the 
published neis serial sequences, and furthermore nor with 
any published sequence. These clones therefore constitute, 30 
with the majority of the other clones isolated, a bank of 
totally new Nm-specific loci. 

EXAMPLE 3 

Study of Region 2 of the Nm Chromosome 35 
Determination and Characterization of the Sequence of 

Region 2 

PCR amplification is carried out with the chromosomal 
DNA of strain Z2491 of serogroup A, sub-group IV- 1 using 
oligonucleotide primers formulated from each of the 40 
sequences of clones of region 2 in several different combi- 
nations. The PCR products which overlap are sequenced 
from the 2 strands using the chain termination technique and 
automatic sequencing (ABI 373 or 377). 

To prolong the sequence beyond the limits of the clones 45 
available, partial SauIIIA fragments of 15 kb of the strain 
Z2491 are cloned in Lambda DASH-II (Stratagene). 

The phages containing the inserts overlapping region 2 
are identified by hybridization with clones of this region as 
probes. The DNA inserted is sequenced from the ends of the 50 
inserts, and these sequences are used to formulate new 
primers which will serve to amplify the chromosomal DNA 
directly, and not the phagic DNA. 

An amplification of the chromosomal DNA is obtained 
using these new primers and those of the sequence previ- 55 
ously available. 

These PCR products are also sequenced from the 2 
strands, which leads to a complete sequence of 15,620 bp 
(SEQ ID No. 36). The reading frames of this sequence which 
start with ATG or GTG and are characterized by a high 60 
codon usage index are analysed. 

This analysis reveals 7 ORFs of this type which fill the 
major part of the sequence of 15,620 bp. The positions of 
these ORFs are the following: 

ORF-1: 1330 to 2970 (SEQ ID No. 37); ORF-2: 3083 to 65 
9025 (SEQ ID No. 38); ORF-3: 9044 to 9472 (SEQ ID No. 
39); ORF-4: 9620 to 12118 (SEQ ID No. 40); ORF-5: 12118 
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to 12603 (SEQ ID No. 42); ORF-6: 12794 to 13063 (SEQ 
ID No. 43); ORF-7: 13297 to 14235 (SEQ ID No. 44); and 
ORF-8: 14241 to 15173 (SEQ ID No.45). 

ORF-4 starts with the codon GTG and overlaps a slightly 
smaller ORF (SEQ ID No. 41) in the same reading frame 
(10127-12118, frame 2), which starts with the codon ATG. 

ORF-4 codes for a protein which has structural homolo- 
gies with a family of polypeptides comprising pyocins 
{Pseudojnonas aeruginosa), collcins and intimins 
{Escherichia coli), which are bactericidal toxins (pyocins, 
collcins) or surface proteins involved in adhesion of bacteria 
to eukaryotic proteins. ORF-7 encodes a protein, the 
sequence of which contains a potentially transmembrane 
region and which has structural homologies with periplas- 
mic proteins or proteins inserted in the external membrane 
of bacteria. The structural homologies of ORF-4 and ORF-7 
have been identified with the aid of the PropSearch program. 

Investigation of sequences homologous to other ORFs in 
GenBank with the aid of the BLAST program revealed a 
homology between the N-temiinal regions of ORF-2 and 
filamentous haemagglutinin B of Bordetella pertussis (43% 
similarity, 36% identical over 352 amino acids) and between 
ORF-1 and the protein fhaC of Bordetella pertussis (35% 
similarity, 27% identical over 401 amino acids). ORF-1 and 
ORF-2 are neighbouring genes in the strain Z249I and 
filamentous haemagglutinin B of Bordetella pertussis and 
fhaC are neighbouring genes in Bordetella pertussis, which 
reinforces the probability that these homologies reflect func- 
tional homologies. 

Confirmation of the specificity of region 2 with respect to 
Nm 

Southern blots are carried out using the DNA probes 
obtained by PCR amplification of various parts of region 2 
using oligonucleotide primers formulated firom sequences of 
clones of region 2. 

The approximate position of these oligonucleotides is 
shown on FIG. 4. 

These are the oligonucleotides called R2001 (SEQ ID No. 
46) and R2002 (SEQ ID No. 47) in one half of ORF-1, the 
oligonucleofides b332a (SEQ ID No. 48), el39a (SEQ ID 
No. 49), bl32a (SEQ ID No. 50) andb233b (SEQ ID No. 51) 
in one half of ORF-l+the majority of ORF-2, and the 
oligonucleotides el45a (SEQ ID No. 52) and blOla (SEQ 
ID No. 53) in 'A of ORF-4+ORF-5 to 7. 

The three Southerns are carried out under the following 
hybridization conditions: 
16 hat 65° C, 
NaP04 0.5 M, pH 7.2 
EDTA-Na 0.001 M 
1% sodium dodecylsulphate. 

For the washing, heating is carried out for 10 min at 65° 
C, and NaP04 0.5 M, pH 7.2; EDTA-Na 0.001 M, 1% 
sodium dodecylsulphate are used. 

FIGS. 5, 6 and 7 respectively show the Southern blots 
obtained with each of the abovementioned ORF parts. 

The 14 tracks correspond respectively, in each of the 
Southerns, to 
1: MSll (Ng) 
2: 403 (Ng) 
3: FA1090 (Ng) 
4: Wl (Ng) 
5: 6493 (Ng) 

6: marker (lambda hindlll) 

7: Z2491 (Nm, gpA) 

8: 7972 (Nm gpA) 

9: 8013 (Nm, gpC) 

10: 1121 (Nm, grouping not possible) 
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11: 1912 (Nm, gpB) 
13: 32165 (Nc) 
14: 8064 (Nl). 

Given that a panel of strains of Neisseria is used in these 
experiments and that each well is charged with a similar 5 
amount of digested DNA, these 3 Southern blots clearly 
show that the sequences corresponding to region 2 are found 
in all the meningococci tested and that significant homolo- 
gous sequences do not exist in the genome of the Ng of the 
strains tested. 10 

EXAMPLE 4 

Identification of Regions of the Nm Genome 

Absent from Nl and Coromon with Ng 

The tecluiique described in example 1 is followed, but the 
chromosomal DNA of one strain of Nm (Z2491) and 2 
strains of Nl (XN collections), equal parts of the DNAs of 
which are mixed, is used. 

2 subtractions are performed using the R and J series of 
primers. Three different banks are thus obtained. 

Two banks, called Bam and Eco, are obtained respectively 
by digestion of the chromosomal DNA of Nm Z2491 by 
Mbol and Tsp5091; a third bank, called Cla, which results 
from digestion of the chromosomal DNA of Nm by Mspl, is 25 
obtained using the primer set RMsplO, RMsp24, JMsplO 
and JMsp24. All the primers used are shown in the following 
table 2. 



TABLE 2 



AdaDters for differential 1 


banks 


Chromosomal DNA 


Cloning in 


digested by 


pBluescript by 


Mbol ^ 


BamHI 


Tsp509I ^ 


EcoRI 


Mspl -» 


Clal 



30 



35 



40 

First subtraction cycle 

(SEQ ID No. 54) 

RBaml2: 3' AGTGGCTCCTAG 5' 

(SEQ ID No. 55) 
RBam24: 5' AGCACTCTCCAGCCTCTCACCGAG 3' 45 

(SEQ ID No. 56) 

REc o 1 2 : AGTGGCTCTTAA 

(SEQ ID No. 55) 
RBam2 4: 5' AGCACTCTCCAGCCTCTCACCGAG 3' 
(REco 24 = RBam 24) 

(SEQ ID No. 57) 50 

RMspl 0 : AGTGGCTGGC 

(SEQ ID No. 58) 
RMsp24: 5' AGCACTCTCCAGCCTCTCACCGAC 3' 
Second subtraction cycle 

(SEQ ID No. 59) 

Jbaml2: 3' GTACTTGCCTAG 5' 

(SEQ ID No. 60) 
JBam2 4 : 5 ' ACCGACGTCGACTATCCATGAACG 3 ' 

(SEQ ID No. 61) 

JEcol2: GTACTTGCTTAA 

(SEQ ID No 60) 
JBam2 4: 5' ACCGACGTCGACTATCCATGAACG 3' 
(JEco 24 = TBam 24) 

(SEQ ID No. 62) 

JMspl 0 : GTACTTGGGC 

(SEQ ID No. 63) 
JMsp2 4: 5' ACCGACGTCGACTATCCATGAACG 3' 



60 



65 



After 2 subtractions, the entire product of each amplifi- 
cation is labelled and used as a probe. 



The subtractive banks are checked by Southern blotting 
over a panel of 12 strains of Neisseria (chromosomal DNA 
cut by Clal). The hybridization conditions are identical to 
those given in example 1. 

These Southern blots are shown on FIGS. 8 A to 8C, which 
relate respectively to the MboI/BamHI bank, to the Mspl/ 
Clal bank and to the Tsp5091/EcoRI bank. 

The 12 tracks correspond respectively, to 
1: NmZ2491 (group A) 
2: Nl 8064 

3: Nm 8216 (group B) 
4: Nl 9764 

5: Nm 8013 (group C) 
6: Ng MSll 
7: Nm 1912 (group A) 
8: Ng 4C1 

9: Nm 1121 (grouping not possible) 

10: Ng FA1090 

11: Nc 32165 

12: Nm 7972 (group A) 

Examination of the Southern blots shows that the 
sequences contained in each bank are specific to Nm and are 
not found in Nl. Furthermore, the reactivity found with the 
strains of Ng suggests that some of these sequences are 
present in Ng. 

Each of these banks was then cloned in pBluescript at the 
BamHI site for Bam, or the EcoRI sit for Eco, or the Clal site 
for Cla. In order to confirm the specificity of the clones with 
respect to the Nm genome, restriction of the individual 
clones and radiolabelling thereof were carried out. The 
clones showing reactivity for both Nm and Ng were kept for 
subsequent studies. These clones are shown on FIGS. 9, 10 
and 11, which give the profiles with respect to Nm, Nl and 
Ng of 5 clones of the Bam bank (FIG. 9), 16 clones of the 
Eco bank (FIG. 10) and 13 clones of the Cla bank (FIG. 11). 

These clones were sequenced using universal and reverse 
primers. They are 
Bam clones 

partial Bll of 140 bp (SEQ ID No. 66), partial B13 
estimated at 425 bp (SEQ ID No. 67), B26 of 181 bp (SEQ 
ID No. 68), B33 of 307 bp (SEQ ID No. 69), B40 of 243 bp 
(SEQ ID No. 70), 
Cla clones 

CI 6 of 280 bp (SEQ ID No. 72), partial C20 estimated at 365 
bp (SEQ ID No. 73), partial C24 estimated at 645 bp (SEQ 
ID No. 74), partial C29 estimated at 245 bp (SEQ ID No. 
75), C34 of 381 bp (SEQ ID No. 76), C40 of 269 bp (SEQ 
ID No. 77), C42 of 203 bp (SEQ ID No. 78), p C43 of 229 
bp (SEQ ID No. 79), C45 of 206 bp (SEQ ID No. 80), C47 
of 224 bp (SEQ ID No. 81), C62 of 212 bp (SEQ ID No. 82), 
and C130 (5* . . . ) estimated at 900 bp (SEQ ID No. 83), and 
Eco clones 

E2 of 308 bp (SEQ ID No. 84), partial E5 estimated at 170 
bp (SEQ ID No. 85), partial E22 estimated at 300 bp (SEQ 
ID No. 86), E23 of 273 bp (SEQ ID No. 87), E24 of 271 bp 
(SEQ ID No. 88), E29 of 268 bp (SEQ ID No. 89), partial 
E33 estimated at 275 bp (SEQ ID No. 90), partial E34 
estimated at 365 bp (SEQ ID No. 91), E45 of 260 bp (SEQ 
ID No. 92), E59 estimated at greater than 380 bp (SEQ ID 
No. 93), E78 of 308 bp (SEQ ID No. 94), E85 of 286 bp 
(SEQ ID No. 95), E87 of 238 bp (SEQ ID No. 96), partial 
E94 greater than 320 bp (SEQ ID No. 97), partial E103 
greater than 320 bp (SEQ ID No. 98) and EllO of 217 bp 
(SEQ ID No. 99). 

Mapping of each clone was carried out on the chromo- 
some of Nm Z2491 as described in example 1. The results 
obtained are given on the right-hand part of FIG. 2. It is 
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found that these clones correspond to regions called 4 and 5. 
These regions are therefore made up of sequences present 
both in Nm and in Ng, but not found in Nl. It is therefore 
regarded that these are sequences which code for virulence 
factors responsible for the initial colonization and penetra- 
tion of the mucosa. Region 4 is located between argF and 
regF on the chromosome of Nm 2491 [sic], and region 5 is 
located between the lambda 375 marker and penA. This 
region probably contains sequences which code for an Opa 
variant and a protein which binds transferrin. 

A comparison with the known sequences in the databanks 
has half [sic] that in region 4 only the clone CI 30 has a 
homology, that is to say with Mspl methylase. In region 5, 
no homology with known sequences was found with the 
clones C8, E2, B40, C45, E23 and E103. For the other 
clones, the homologies are the following: 

Bll aiginine decarboxylase SpeA; C29 arginine decar- 
boxylase SpeA; C62 oxoglutarate/malate transporter; repeti- 
tive DNA element; E34 repetitive DNA element; E94 
murine endopeptidase MepA; C47 citrate synthase PrpC; 
E78 citrate synthase PrpC 

EXAMPLE 5 

Demonstration of the Presence of One or More 
Strains of Neisseria meningitidis in a Biological 
Sample 

A biological sample of the cephalorachidian fluid, urine, 
blood or saliva type is taken. 
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24 

After filtration and extraction, the DNAs present in this 
sample are subjected to gel electrophoresis and transferred 
to a membrane by Southern blotting. 

^ A nucleotide probe constructed by labelling SEQ ID No. 
5 with ^^P is incubated with this transfer membrane. 

After autoradiography, the presence of reactive band(s) 
allows diagnosis of the presence of Neisseria meningitidis in 
the sample. 

10 

EXAMPLE 6 

Vaccine Composition Including in its Spectrum 
15 Antimeningocococcal Prophylaxis and Intended for 
Prevention of any form of Infection by Neisseria 

meningitidis 

The peptide coded by a sequence including SEQ ID No. 
20 10 is conjugated with a toxin. 

This conjugated peptide is then added to a composition 
comprising the anti-Haemophilus and antipneumococcal 

vaccine, or any other childhood vaccine. 

25 After having been sterilized, the resulting composition 
can be injected parenterally, subcutaneously or intramuscu- 
larly. 

This same composition can also be sprayed on to mucosa 
with the aid of a spray. 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(iii) NUMBER OF SEQUENCES: 99 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GATCCGCTGC CGGCAGACGA ATATCAAGAC ATCTTCGATT TTATGAAACA GTATGACTTG 60 

TCTTACCCGT ATGAATATCT GCAGGATTGG ATAGATTACT ATACGTTCAA AACCGATAAG 120 

CTGGTATTTG GTAACGCGAA GCGAGAGTGA GCCGTAAAAC TCTGAGCTCC TGTTTTATAG 180 

ATTACAACTT TAGGCCGTCT TAAAGCTGAA AGATTTTCGA AAGCTATAAA TTGAAGCCCT 240 

TCCACAGTAC ATAGATC 257 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GATCATGTTC AAATAGATAG GCATGGGAAG CTGCAGCTCT AACGTCCATG AAAATATGTT 60 

GCATAGCTGC AAGCGGAACG CCTTTTCTTT CATCTACATA ATCTATAGAG TCAAGGCAAC 120 

CGCTATTGAA ATTAGCAGTA TTGCCTATGA TTACATTAGT AATATGCTCA TACCATTTTT 180 

GGGTGGTCAT CATATTGTGC CCCATTGTTA TCTCCTTATA TTGGTTTTAG AAGGAACTTT 240 

GACAGGAAGA ATAACGGCCT TACCTGTTTG ACGATC 276 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 428 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GATCTGGTGG TGTTTGCACA GGTAGGCGCA TACTTGTTCG GGACTGAGTT TGCGGCGGAT 60 

AAGGGTGTCG ATGTGCTGAA TCAGCTGCGA ATCGAGCTTA TAGGGTTGTC GCTTACGCTG 120 

TTTGATAGTC CGGCTTTGCC GCTGGGCTTT TTCGGCGCTG TATTGCTGCC CTTGGGTGCG 180 

GTGCCGTCTG ATTTCGCGGC TGATGGTGCT TTTGTGGCGG TTAAGCTGTT TGGCGATTTC 240 

GGTGACGGTG CAGTGGCGGG ACAGGTATTG GATGTGGTAT CGTTCGCCTT GGGTCAGTTG 300 

CGTGTAGCTC ATGGCAATCT TTCTTGCAGG AAAGGCCGTA TGCTACCGCA TACTGGCCTT 36 0 

TTTCTGTTAG GGAAAGTTGC ACTTCAAATG CGAATCCGCC GACCTCTTTC AGTTACAGCA 42 0 

GCTTGATC 428 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 390 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GATCCTGCAT TGACATCGGC CTTGGCTGTC AGGGTATTGT GACCGGTAAA GTCGGCATTA 60 

CCGTTGGCCA ATAAGGATAC ATGACCGTCT GCAGAAACAG CATGAAGGCC GTCTGAAACG 120 

ATATTGCCCT GCAATGCGGT GGTTTCGAGA GCCTTGGCTG CGTTCAGCTT GGTATTGCGA 180 

AGCTGAATAT TGCCTTTGGC TGCCTGAATG TGCAGATTAC CCGAGTTGGT ACGCAGATTG 24 0 

GTATTGGTAA CATTCAGCAA GCCTGCCTCC ACACCCATGT CTTTTGAGGC AGTGAGGGTT 30 0 
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TTACTGGTGC CGGTAATATG GGCAGCGTTA TCCGATTTCA AATGGATGCT GGCCGGCAGA 360 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 177 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GATCAGATTG GTGAAGACGG TATTACCGTC AATGTTGCAG GCCGTTCGGG ATATACGGCG 6 0 

AAAATCGACG TGTCTCCGAG TACCGATTTG GCGGTTTATG GCCATATTGA AGTTGTACGG 12 0 

GGTGCAACGG GGTTGACCCA ATCCAATTCA GAGCCGGGTG GAACCGTCAA TTTGATC 17 7 



(2) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 341 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GATCAATGAT GCTACTATTC AAGCGGGCAG TTCCGTGTAC AGCTCCACCA AAGGCGATAC 60 

TGAATTGGGT GAAAATACCC GTATTATTGC TGAAAACGTA ACCGTATTAT CTAACGGTAG 12 0 

TATTGGCAGT GCTGCTGTAA TTGAGGCTAA AGACACTGCA CACATTGAAT CGGGCAAACC 18 0 

GCTTTCTTTA GAAACCTCGA CCGTTGCCTC CAACATCCGT TTGAACAACG GTAACATTAA 24 0 

AGGCGGAAAG CAGCTTGCTT TACTGGCAGA CGATAACATT ACTGCCAAAA CTACCAATCT 300 

ACTCCC GGCAATCTGT ATGTTCATAC AGGTAAAGAT C 341 



(2) INFORMATION FOR SEQ ID NO : 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GATCCAACTG TTTGATTTTA CTGGCTGCTT CTCCATGCGC GGTATTGACC AAAGCCGCAA 6 0 

GGATATTCGC TTCCAGATTG TCTTTCAGGC TGCCGCCGTT GACAGCGGTA TTAATCAGTG 12 0 



CAAATCTTTA TCAACATTCA AATTCAGATC 



390 



CGGCACTGCC CGCATTGGCT AGGTTGACGG TCAGGTTGTT GATC 



164 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GATCAATCAC ACATCTTGTC ATTTTTTCGA TTCCTTCATT TCGGTTTCTA ATGTTTCAAT 60 

TCTTGCGGCC ATTTCCTGAA TGGCTTTAGT CAAAACGGGG ATGAACGCTT CGTATTCGAC 12 0 

GGTGTAGGTA TCGTTTGTTT TATTTACCAT CGGCAATCGA CCATATTCAT CTTCCAGCGC 180 

AGCAATGTCC TGGGCAATAA ACCAATGCCG CAACCGATC 219 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 356 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GATCTTGGGT AAGCCCCCAA CCTGCATAGA AAGGCAGGCC GTAGCAGCTG ACTTTTTTGC 60 

CGCGCAACAA GGCTTCAAAA CCGGTCAGCG AAGTCATGGT ATGTATTTCG TCTGCGTATT 120 

GGAGACAGGT CAGGATGTCG GCTTGTTCGG CGGTTTGGTC GGCATATCGT GCAGCATCAT 18 0 

CAGGGGAAAT ATGGCCGATG CGGTTACCGC TGACTACATC GGGATGCGGT TTGTAGATGA 24 0 

TATAGGCATT GGGGTTTCGT TCGCGTACGG TACGGAGCAA ATCCAGATTG CGGTAGATTT 30 0 

GGGGCGAACC GTAGCGGATA GACGCATCAT CTTCAACCTG GCCGGGAACG AGGATC 356 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GATCCGCTTT CAGTTTCCGT ACCGGTGGCA TCAGTCAAGT CCGTTTTGTG CACCAAACCG 60 

CGTCCATATG AAACATAAAA CAAATCGCTT AAGCCCAAAG GGTTATCGAA CGATAAAGCG 12 0 

ACATTTCCTT GATATTTGCC GGTCGTTTTG CCGCCCGCAT CATCTATACC GATACTGAAC 18 0 



CGTATGGGTT TATTCTGCTG CCATTTGATC 



210 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GATCCCGAAA CGCAATTGGT CGAAAGCTAT ATGCTGAACG ATGTGTTGCG GTTTTGGGAC 60 

AGCGCAGGTT TGGGCGATGG GAAAGAAGCC GACCGCGCCC ATCGGCAAAA ACTGATTGAT 12 0 

GTCCTGTCTA AAACCTATAC TCATTCGGAT GGGCAGTGGG GCTGGATAGA TTTGGTGTTC 18 0 

GTTATCCTTG ACGGCAGCTC CCGCGATTTG GGTACGGCCT ATGATTTGTT GAGGGATGTT 24 0 

ATCCTTAAAA TGATTGATC 259 

(2) INFORMATION FOR SEQ ID NO : 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 36 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GATCAAATGG ATGATTTATA TAGAATTTTC TTTTACGACT GCGTGCCGTT TGAAAAGAAA 60 

ATGCACAATC CCGTATCTCA TCGTGCCATA GATTTTTCAA AGACTCCGGA AGCCATATTT 12 0 

CGTTGCAATC TGCATACCGA ATTGAAGAAG AAGCGTAAAT TAGCGTTACG TTTAGGCAAG 18 0 

CTGTCGGACA ATACAGCATG GATATTAAAA CCCCAAGTCA TGAAAAATCT TCTGAAAAAC 24 0 

CCGTCAACTC AAATTACGGA AAACGATGTC GTGCTCGATG TTAAACAAAA AGGTGTAGAT 300 

ATGCGTATAG GCTTGGATAT TTCATCTATT ACCTTAAAAA AACAAGCCGA TAAAATCATC 360 

TTGTTTTCTG GTGATTCCGA TTTTGTCCCA GCAGCCAAAT TAGCCAGACG GGAAGGTATC 420 

GATTTTATTC TTGATC 436 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 363 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



GATCGTTTTA CGTCGCAATC GAGCTTTGTG GTGCGCTCGC CTAAAAGCCA ATCTTCTCTC 



60 
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AATGGCCTGG GTGCCATTTT GCAGGGCACA GGTTTTGCCC GTGCGCAAGA CGATATTTAT 120 

ACCGTGCAGG AATATATGCA GTCGCGTTCG GCTTTGGATG CGTTGCGTAA GAAAATGCCC 180 

ATTCGCGATT TTTATGAAAA AGAAGGCGAT ATTTTCAGCC GTTTTAATGG TTTTGGCCTG 240 

CGTGGCGAGG ATGAGGCGTT TTATCAATAC TACCGTGATA AGGTATCCAT CCATTTTGAC 300 

TCTGTCTCAG GCATTTCCAA TTTGAGCGTT ACATCGTTTA ATGCCGGTGA ATCTCAAAAG 360 

ATC 363 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GATCTTGCGT CATTTATATC TTCACCGATA TTGCAATTAC CGCCGTTCCA GTTGAAATAA 60 

CAACGACTAA AATTGTAGTT CCTAAAAGAA TCATTCCTAT TCTTGCGTAC CATTTCCCAA 120 

TAATTGCGCC CGACAATTTC CATTTAATGC TCCATCAGTT CTTTTACTTC CGGAAATCTG 180 

CTGTAATCTG ACATAAGACG CATAATTGAA CTATCAACGC CGTAACAGCC ATAGGTTTTA 240 

ATACCGTTTT CGGCGTGTTC CCAAATGCAA TTACTGTATT CGTAGCCTTT TACAAATTTA 300 

TCGGTTTCGG GATC 314 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 256 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GATCATACGA ATCTACCCTA AAATACCCCG TCGCCGATTT AGGATTGGCT ACATAAAGCT 60 

CATTATAAGG GTATTTTGAT GACATGATAC GGTTAAATTC ATTGCCGTTG TTTATCCTGA 120 

TTCTATAAAT TGGTTCAACA GCAAAGCCTC TGGATTCCCT TAATTGATTA TAATATTGCC 180 

TGTATGTTTG TACATCATGT CTTGTCCACG GCTCTCCAGG AGTCCTCAGA ATAGCAATCC 240 

CGTTAAATTT CGGATC 256 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 35 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GATCCACGCC TGTGCCTACC TTGGCTTTTT GTTCGCCAAA CAAGGCATTT AAGGTTGAGG 60 

ACTTGCCGAC ACCTGTCGCA CCGACAAGCA AGACATCCAA ATGACGGAAA CCGGCTGCTG 120 

TGACTTTTTG CCCGATTTCA GAAATACGGT AACGATGCAT ATGCGCTCCT ACCAGCCAAA 180 

AAAAGAAGCA ACCGTGCTAA TCGCCCCTCC AATCGCTTTT GCAGCACCGC CGATC 235 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GATCCAACGG GCATCGCTGT CCTTACTCGG TGTGGTTTGA CCGCTGATTT GTCCTTCTTC 60 

GTCAACTTCT ATGGCCTGAC GCTGTTTGCT GCCGGCGGTC TGGATAATGG TGGCATCAAC 120 

GACGGCGGCG GATGCTTTCT CTATTTTTAG GCCTTTTTCG GTCAGTTGGC AGTTAATCAG 180 

TTTGAGTAAT TCGGACAGGG TGTCGTCTTG CGCCAGCCAG TTGCGGTAGC GGCATAAGGT 240 

ACTGTAATCG GGGATGATC 259 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GATCTGTGCC GTTGATTTTA TCTTTCAGAT GCAGCATCGA ATATCGGAAA GCCAAATCAG 60 

CAATTCTTTT TGCATCGTGT GGATTTTGAG ACGGGCCTAA TGACCGTACC CGCTTAATAA 120 

AAAATGCACC GTCAATCAAA ATGGCGGTTT TCATATTGCT TCCCCTATAT TTGTCAAAGA 180 

TATAAAAAAG CCCTTGGGAT C 201 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



AATTCAAAGG AGGCATTTGT TGCAAGAAAA GTACAAAGTG ATTTGCAAAA AGCATTGAAT 



60 



GCTAGCAACT ATAACAAGCA GCAATATGCA AGACGTGCGG CAACAGCGTT AGAGAATGCT 120 

TCAAAATCAA AAGTTATGGC AGCGAATTCT TTTTGATCTA TCTTGTGCGA ACGGGTCAAA 180 

TATTCTTCGT ACATTGAGTT AATCGTACCA ATCGCCCTAA CCACATTTTC ATCAGAAAAT 24 0 

ATGGAAATAA TAGCATCCCT ATACGCACCT AGTGTAATAT TGTTTCTATT ATTAGTTATA 30 0 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 38 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

AATTCCTGCG CACCTTTGCC GATGGGGAGA TAATCGCCTT TTTGCAGCAT TCTGCCCTGA 60 

TGGCCGCCGA AACCGGCTTT CAGGTCGGTA CTTCTCGAAC CCATCACTTC CGGCACATCA 120 

AATCCGCCCG CCACGCACAC ATAGCCGTAC ATGCCCTGCA CGGCACGCAC CAGTTTCAAG 18 0 

GTCTGCCCTT TGCGGGCGGT ATAACGCCAA TACGAATAGA CCGGTTCGCC GTCCAATT 23 8 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

AATTGGGCGA GATGCTGCCG GAAACGGATT TAAAACAGAT TGCGGCGGCA GTGTTGAAGA 60 

CGAACGATGA GGCGGCATTG CAGAAGGTGG TGAAAACGGC CAAAGGCAAT GCGCGGAAAC 120 

TGTCGAAGCT GCTGCTGATT GTGGACTATT TGTTGCAGGT TAACCCTGAT GTTGATTTGG 180 

ATGATGATGT AATCGAACAC GCGGAAACCT ATTTAATCCA CTAAACCTTT GACAGATAAG 240 

GCAATAATT 249 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



GCATTATTCG AATACATAAT AGCACCTCCA AATT 



334 



(ii) MOLECULE TYPE: DNA (genomic) 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

AATTTATGTA CGGTTTTGCC GTTTGCAGTC AGCCAGTCGG CAAGGCGCAG AAAAAAATCG 60 

CCGACAGGGC CTTGAAGCAG CAGGATATTT TCTGCGCTTT CAAGCAGGTT TTGCAGGTTA 120 

TTTTTGAGGA CGGTCTGTTT CATGTTGCAA TGTGGTTTTG TTTTTTATGT AATAGTTTTA 180 

GGTTGAACTT TCAAGCATAC GCCAAGAGAA TT 212 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

AATTCAGTGC CTGCGTCATA TCACGGCTAC CTTGTGGTTC AGGGTTACTG TATCGCCCGC 60 

GGCATCGACG GCTTCAATAT GCAGCTTCAG CCAGCCGTGC TGCGGGGCGG ATGCGGTTAC 120 

TTGGATGGAT TGGGCGCGTT TGGACTGAAT CACGGGCTGC AAGGCTTGCT CGGCGTACTG 180 

TTTGGCCAGT ACTTCGATGC GCTTTAAATG CTTTTGGCGG CGCAATT 227 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

GATCCAGGAC TCAAAAACCG ATTTCCTAAT AGAGTGTCTA ATATCCCAAT CTTTTTTACC 60 

CCCTCTGCTG TAGAATTGAT AGAGAAAGTT TGTCTATCTT TTTCATATAC CCATGCCTTC 120 

TTTTTATCAT TGTAGCTAAC ATAACCGCCA AACAATGCTT CTAGATC 167 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
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AATTCTTGCG GCCATTTCCT GAATGGCTTT AGTCAAAACG GGGATGAACG TTTCGTATTC 



60 



GACGGTGTAG GTATCGTTTG TTTTATTTAC CATCGGCAAT CGACCATATT CATCTTCCAG 120 



CTTGATTGGA TTCGCCCACC ATTCGCGGAC TTTGTCCGCT CGTTCATCTG CCGGCAAGTC 240 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

AATTCCCGAC TATCGCGGAT GCGTAGTTTT TGCCGGTGGG CAAGAGCAGG TGTGGGATAA 60 

GTTAGGTGAT TTGCCCGATG GCGTCAGCCT GACCCCGCCT GAATCGGTAA ATATTGACGG 120 

CTTAAAATCC GTAAAACTCG TCGCATTAAA TGCTGCCGCT CAGGCTTTTA TTAACAAGCA 180 

CGCCGGTATC GACAGCGTAC CTGAATT 207 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 379 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

AATTGTTTGG GAATAATCCA AACAAACAGC ATCAGGATAG CGGCGGCGGT CAGGCTGCCT 60 

GAAAGGATTT TGCCGGGGTT TTTTGTAGGC AAAGCGGACG AGAAACCAAA GCAACAGCAG 120 

CATGGTGTCC CAATAGCCGA TTGAGAATAG GATGGCCAAA CCTTCTAGGA AATGGCGTAA 180 

ATCGTTTGTG GTAACCATGG GTAGTTCCTG TGGTTAAATG TGCAGGCTGC TTTTTGCCGA 240 

ACCTTGCCGC ATCTCAAAAG CAGCCTGCGC TTCAGCGTTG CGTTACGCAG TAAAATAATG 300 

AATATTTGTA ACGGCTTGGG TATTTTTTGT CAATATTCCC GCCCTTCCCT TAACAGCTGC 360 

CGCGCTTTCC GTTAAAATT 379 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



CGCAGCAATG TCCTGGGCAA TAAACCAATG CCGCAACCGA TCTTCTTTAT GACTGCCGTC 



180 



TTTGAATAAT T 



251 



(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



AATTCGCCGA AATCAGGCTG CTGCTCGATA ATCGGCGCGG CCGATTGGCG TTGTGCCTCG 



60 



ATTAAATCCA TCTTGTCTTG CAGACGTTTG GCCTGGCCTT TGCGGCGGCG TTCGGCCAGT 



120 



TGTTCCATCC GCGTTTCCGC AAATGCCGCC CGTTTGTTGC CGTTGAATAC CGCTTTGCAA 180 
ATCACCTTGC CCTGCATATC CTTCACAATC ACATGGTCGG CATCGTGGAT GTCGTAAGCC 240 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

AATTCCGTTC TTATTGGGCT TTTTCCATCC ATCGGGTATG CCTGAAGGGA ACGCAAACCC 60 

TGCCACTTGC CCATCGCTCC ATTCCCGCAT TAGCGCGTCT GACGGCAAGT GTTCTCGCGC 120 

CCAATCAAGC CACGCCTGCC GCATTGCGGC CTTGTCCTGC TGAAAACTTC GCAGTGCTTT 180 

TGCAACCGGC CCATCATTAA CTTCAATCAA ATAAATCATT ATATTTGCGT TCATTTTTCC 240 

TACACCTTCG CCACATCCAA ATT 263 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 316 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

AATTGTTCAA GAAAAAAGTC GGCACGGCGC GGCAACGGGG AAAATGCGTT GACGCCGTCT 60 

TTTTCTAAGG TGATGTAGTA GGGGCGGAAA TAGCCTTCTT CAAACGCCCA GAAACTGGCT 120 

TGGTTTTCGT TTGCAATGCG TTTTGCAATG ACGTGATAAG GGCGTGTGTC GCCAAAGCAG 180 

ACAACGGCCT GGATGTGATG TTGAGTGATG TATTCTTGCA AAAACTCAGG AAAGGCGTCG 24 0 

TAGTTGTCGT TAAAAACAAC GGTATGCGCT TGAGTGGGCG GATAAAAATA GTCGTCGCCT 30 0 

GCATTAAAGT TGAATT 316 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 324 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



ACCCGTACCT TCTGACCGCT GTAATCCAGC AATT 



274 
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(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

AATTCAATCA ACGGAAAACA CATCAGCATC AAAAACAACG GTGGTAATGC CGACTTAAAA 60 

AACCTTAACG TCCATGCCAA AAGCGGGGCA TTGAACATTC ATTCCGACCG GGCATTGAGC 12 0 

ATAGAAAATA CCAAGCTGGA GTCTACCCAT AATACGCATC TTAATGCACA ACACGAGCGG 18 0 

GTAACGCTCA ACCAAGTAGA TGCCTACGCA CACCGTCATC TAAGCATTAC CGGCAGCCAG 24 0 

ATTTGGCAAA ACGACAAACT GCCTTCTGCC AACAAGCTGG TGGCTAACGG TGTATTGGCA 30 0 

CTCAATGCGC GCTATTCCCA AATT 324 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 30 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

AATTATGCAA AAAAACGCAA CGCCGAAAAA CTGGCACCGC GCGGATATTG TTGCTGCTTT 6 0 

GAAAAAGAAA GGCTGGTCAC TTCGAGCACT TTCAATAGAA GCGGGGTTGT CGCCGAATAC 12 0 

GCTTAGAAGC GCACTGGCCG CCCCTTATCT TAAGGGAGAA AGGATTATTG CCGCTGCAAT 18 0 

CGGAGTGGAA CCGGAAGAGA TTTGGTCCGA ACGGTATGCA GATCGGAATT 230 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 49 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

AATTTAATCG GTGGAATGCC TGTTCAACCG CACCAATCCC GCTGAATACG GTTGCTAATC 60 

TAATATGTGA ATCAGGTTTA AGAAAAGTTT TAGATTTCCA ACCTTGTTGA CTGGGAAAGA 12 0 

GCAAAGTTTT TTGTAATCGA GTATCGTGTG TCTGTGCCAT TGTCGAAATA GTCATACTTA 18 0 

TATCGTTCTG TTTATCTTAT CAATATGAAA ACTACATCGT TGATTGCCCT GACAATGCCT 24 0 

TGGTCAATT 249 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 43 base pairs 

(B) TYPE: nucleotide 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

AATTCTTGTC CCGGAGTCCA ACGTATATTT ACCCTCCTGC GAGCTAAAAG ACTATTATTC 6 0 

TCCACTGCCA CAGTAGCCGC ATTCACCGCC GTATTCACAT CCCCTTTAAC CAATGCCACT 12 0 

GCGCTGCCTG CGATAATCTG CGAGTAGGCT ATGACTTTTT GGCGTTCTTG GGGTGACAGT 18 0 

TTGCCTACAT CGCGTCCGTC CAACAGGGTT TCTCCCACCA TCTCGCCGAC TGCCGCGCCG 24 0 

ATTGCGCCGT CCCGACATTT GCCTTTATTT GCTACCGCCG ATGCACAGCC TGCTACGGCA 300 

TGGGCTATCT TGTGGGCAAT GTAGTCTTCG CTGAGATTAA ATT 343 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

AATTCTTCAA ACATCGTTTC GATAATCGGG TCGGTGTACA CACTGATGCG GTCGCCCGCA 6 0 

CGGCTTTGAC CGGCTCGGAA AATATAGGCG GTGGCTTTGC CGTCGGCGAT GTCGACGCAC 12 0 

CAACGCCAGA TGGCGTCTTC GGTATTCAAA CAATCACCCG CACAGCTTTC ACCTGCGCGG 180 

AATT 184 

(2) INFORMATION FOR SEQ ID NO : 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15620 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

TATGCTCAAT CTCATTTTCA AAATGCAAAA CTTTTCTGAT TTTTCCTACT TTTTGCTCAA 6 0 

TATTAGGAAG GTTTTAGGCA ATTGAAAATT TTTTGGCGCA TTTTTATGCG TCAAATTTCG 12 0 

TTAACAGACT ATTTTTGCAA AGGTCTCCGT CTGTAAAAGC AAGGATAGGG CATCTGCCCT 18 0 

TTTGATTGTT TGATTAACGA TACAAGGAGT TTCAAAATGA GAGTTTTATA GTGGATTAAC 240 

AAAAACCAGT ACAGCGTTGC CTCGCCTTGC CGTACTATTT GTACTGTCTG CGGCTTCGTC 300 

GCCTTGTCCT GATTTAAATT TAATCCACTA TATGTGTTCA TGAAATGACT TGGGTCGGAG 360 

GCTCAGGTAA TGCGCAACAA AGTTCATATT ATTGCGAAAT TTGCGAATCT GCAGGGCTTA 420 
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ACGATACGGG AAATCCTGAT AAATCTTTAG GATTGCCAAA CAATACGTTC AGTAATCCGC 480 

CTGGTTGGGG AGCTACAATC GGAGCTTTAG CAGGTAGCCG CATAGGTATG CCTGAATTTG 540 

GTACGTTTGC GAGCCATGCC ATTGAAAATT TCGACTGGTC ATGGTATCGA CGTTATAGGG 60 0 

AAATTGCCGA AACGATTGAA CGAGAATATT CAGGCGGTTT GCCTTAATAG TTGAGGAGGT 66 0 

CATGATGTTT GCCAAACATT ATCAATTCAT CGCACTCGGC ATCATGCTGC TTCTTTATAT 72 0 

GTTGATTCTC TATACGACCG ATTTTTCCAA TCTGACGTAT TGGATGCTGT TTTTTATCTG 780 

TTTTATTACA GGAAAAATAT TAGCTCGTTT GTTAGAGAAA AGCTTTAAAT AAAATAGCAG 840 

CTAGTCGCAA AAGGTCGTCT GAAACCTTTT CAGGCGGCCT TTCTAAAATA CATCCAACTT 900 

CCTAATCCCT ATTTTTCAAA AAGGAAATCT ATGCCCCATC TGCAAAACCT GTCTTTGGGC 960 

TTAAAGAAAA AGCTGCCTGT TATCCTGCAA ACAGAAATAT CAGAATGCGG CTTGGCATGT 1020 

CTGGCGGCTG TGGCGGGATT TCATGGTTTC CATACGAATT TACGCGCACT GCGTTCAAAA 108 0 

TACTGTCCGA GACCTTTGCA AAATTCCCCA AAATCCCCTA AATGTCTTGG TGGGAATTTT 114 0 

GGGGAATTTT GCAAAGGTCT CATTCTATAA CTGTAAATAC TTTTAAATTT ATGACAAAAT 120 0 

AGTAAATATT GCTAAAATAA TATTGATGTC ATGAAATTTT TTCCTGCTCC ATGTCTGTTG 126 0 

GTTATCCTGG CTGTCATACC CCTTAAAACC TTAGCTGCCG ATGAAAACGA TGCAGAACTT 1320 

ATCCGTTCCA TGCAGCGTCA GCAGCACATA GATGCTGAAT TGTTAACTGA TGCAAATGTC 1380 

CGTTTCGAGC AACCATTGGA GAAGAACAAT TATGTCCTGA GTGAAGATGA AACACCGTGT 1440 

ACTCGGGTAA ATTACATTAG TTTAGATGAT AAGACGGCGC GCAAATTTTC TTTTCTTCCT 1500 

TCTGTGCTCA TGAAAGAAAC AGCTTTTAAA ACTGGGATGT GTTTAGGTTC CAATAATTTG 1560 

AGCAGGCTAC AAAAAGCCGC GCAACAGATA CTGATTGTGC GTGGCTACCT CACTTCCCAA 162 0 

GCTATTATCC AACCACAGAA TATGGATTCG GGAATTCTGA AATTACGGGT ATCAGCAGGC 168 0 

GAAATAGGGG ATATCCGCTA TGAAGAAAAA CGGGATGGGA AGTCTGCCGA GGGCAGTATT 174 0 

AGTGCATTCA ATAACAAATT TCCCTTATAT AGGAACAAAA TTCTCAATCT TCGCGATGTA 1800 

GAGCAGGGCT TGGAAAACCT GCGTCGTTTG CCGAGTGTTA AAACAGATAT TCAGATTATA 1860 

CCGTCCGAAG AAGAAGGCAA AAGCGATTTA CAGATCAAAT GGCAGCAGAA TAAACCCATA 1920 

CGGTTCAGTA TCGGTATAGA TGATGCGGGC GGCAAAACGA CCGGCAAATA TCAAGGAAAT 1980 

GTCGCTTTAT CGTTCGATAA CCCTTTGGGC TTAAGCGATT TGTTTTATGT TTCATATGGA 2 040 

CGCGGTTTGG TGCACAAAAC GGACTTGACT GATGCCACCG GTACGGAAAC TGAAAGCGGA 2100 

TCCAGAAGTT ACAGCGTGCA TTATTCGGTG CCCGTAAAAA AATGGCTGTT TTCTTTTAAT 216 0 

CACAATGGAC ATCGTTACCA CGAAGCAACC GAAGGCTATT CCGTCAATTA CGATTACAAC 222 0 

GGCAAACAAT ATCAGAGCAG CCTGGCCGCC GAGCGCATGC TTTGGCGTAA CAGGTTTCAT 228 0 

AAAACTTCAG TCGGAATGAA ATTATGGACA CGCCAAACCT ATAAATACAT CGACGATGCC 2 340 

GAAATCGAAG TGCAACGCCG CCGCTCTGCA GGCTGGGAAG CCGAATTGCG CCACCGTGCT 2 400 

TACCTCAACC GTTGGCAGCT TGACGGCAAG TTGTCTTACA AACGCGGGAC CGGCATGCGC 2 460 

CAAAGTATGC CCGCACCTGA AGAAAACGGC GGCGGTACTA TTCCAGGCAC ATCCCGTATG 2520 

AAAATCATAA CCGCCGGATT GGATGCAGCG GCCCCGTTTA TGTTGGGCAA ACAGCAGTTT 2580 

TTCTACGCAA CCGCCATTCA AGCTCAATGG AACAAAACGC CTTTGGTTGC CCAAGACAAG 2 640 

TTGTCTATCG GCAGCCGCTA CACCGTTCGC GGATTTGATG GGGAGCAGAG TCTTTTCGGA 2 70 0 

GAGCGAGGTT TCTACTGGCA GAATACTTTA ACTTGGTATT TTCATCCGAA CCATCAGTTC 2 76 0 

TATCTCGGTG CGGACTATGG CCGCGTATCT GGCGAAAGTG CACAATATGT ATCGGGCAAG 2 82 0 
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CAGCTGATGG GTGCAGTGGT CGGCTTCAGA GGAGGGCATA AAGTAGGCGG TATGTTTGCT 2 880 

TATGATCTGT TTGCCGGCAA GCCGCTTCAT AAACCCAAAG GCTTTCAGAC GACCAACACC 2 940 

GTTTACGGCT TCAACTTGAA TTACAGTTTC TAACCTCTGA ATTTTTTTAC TGATATTTAG 3 000 

ACGGTCTTTC CTTATCCTCA GACTGTCAAA CTTTACCTAC GTACTTGGCG CGCAGTACGT 3 060 

TCATCTTCAA AATGGAATAG ACATGAATAA AGGTTTACAT CGCATTATCT TTAGTAAAAA 3120 

GCACAGCACC ATGGTTGCAG TAGCCGAAAC TGCCAACAGC CAGGGCAAAG GTAAACAGGC 318 0 

AGGCAGTTCG GTTTCTGTTT CACTGAAAAC TTCAGGCGAC CTTTGCGGCA AACTCAAAAC 324 0 

CACCCTTAAA ACCTTGGTCT GCTCTTTGGT TTCCCTGAGT ATGGTATTGC CTGCCCATGC 3 30 0 

CCAAATTACC ACCGACAAAT CAGCACCTAA AAACCAGCAG GTCGTTATCC TTAAAACCAA 3 36 0 

CACTGGTGCC CCCTTGGTGA ATATCCAAAC TCCGAATGGA CGCGGATTGA GCCACAACCG 3 420 

CTATACGCAG TTTGATGTTG ACAACAAAGG GGCAGTGTTA AACAACGACC GTAACAATAA 3 480 

TCCGTTTCTG GTCAAAGGCA GTGCGCAATT GATTTTGAAC GAGGTACGCG GTACGGCTAG 3540 

CAAACTCAAC GGCATCGTTA CCGTAGGCGG TCAAAAGGCC GACGTGATTA TTGCCAACCC 3 600 

CAACGGCATT ACCGTTAATG GCGGCGGCTT TAAAAATGTC GGTCGGGGCA TCTTAACTAT 3 660 

CGGTGCGCCC CAAATCGGCA AAGACGGTGC ACTGACAGGA TTTGATGTGC GTCAAGGCAC 3 72 0 

ATTGACCGTA GGAGCAGCAG GTTGGAATGA TAAAGGCGGA GCCGACTACA CCGGGGTACT 3 78 0 

TGCTCGTGCA GTTGCTTTGC AGGGGAAATT ACAGGGTAAA AACCTGGCGG TTTCTACCGG 3 84 0 

TCCTCAGAAA GTAGATTACG CCAGCGGCGA AATCAGTGCA GGTACGGCAG CGGGTACGAA 3 900 

ACCGACTATT GCCCTTGATA CTGCCGCACT GGGCGGTATG TACGCCGACA GCATCACACT 3 960 

GATTGCCAAT GAAAAAGGCG TAGGCGTCAA AAATGCCGGC ACACTCGAAG CGGCCAAGCA 4 020 

ATTGATTGTG ACTTCGTCAG GCCGCATTGA AAACAGCGGC CGCATCGCCA CCACTGCCGA 4 080 

CGGCACCGAA GCTTCACCGA CTTATCTCTC CATCGAAACC ACCGAAAAAG GAGCGGCAGG 4140 

CACATTTATC TCCAATGGTG GTCGGATCGA GAGCAAAGGC TTATTGGTTA TTGAGACGGG 4200 

AGAAGATATC AGCTTGCGTA ACGGAGCCGT GGTGCAGAAT AACGGCAGTC GCCCAGCTAC 426 0 

CACGGTATTA AATGCTGGTC ATAATTTGGT GATTGAGAGT AAAACTAATG TGAACAATGC 4 32 0 

CAAAGGCTCG GCTAATCTGT CGGCCGGCGG TCGTACTACG ATCAATGATG CTACTATTCA 4 38 0 

AGCGGGCAGT TCCGTGTACA GCTCCACCAA AGGCGATACT GAATTGGGTG AAAATACCCG 4 440 

TATTATTGCT GAAAACGTAA CCGTATTATC TAACGGTAGT ATTGGCAGTG CTGCTGTAAT 4500 

TGAGGCTAAA GACACTGCAC ACATTGAATC GGGCAAACCG CTTTCTTTAG AAACCTCGAC 4560 

CGTTGCCTCC AACATCCGTT TGAACAACGG TAACATTAAA GGCGGAAAGC AGCTTGCTTT 4 620 

ACTGGCAGAC GATAACATTA CTGCCAAAAC TACCAATCTG AATACTCCCG GCAATCTGTA 4 680 

TGTTCATACA GGTAAAGATC TGAATTTGAA TGTTGATAAA GATTTGTCTG CCGCCAGCAT 4 740 

CCATTTGAAA TCGGATAACG CTGCCCATAT TACCGGCACC AGTAAAACCC TCACTGCCTC 4 80 0 

AAAAGACATG GGTGTGGAGG CAGGCTTGCT GAATGTTACC AATACCAATC TGCGTACCAA 4 86 0 

CTCGGGTAAT CTGCACATTC AGGCAGCCAA AGGCAATATT CAGCTTCGCA ATACCAAGCT 4 92 0 

GAACGCAGCC AAGGCTCTCG AAACCACCGC ATTGCAGGGC AATATCGTTT CAGACGGCCT 4 980 

TCATGCTGTT TCTGCAGACG GTCATGTATC CTTATTGGCC AACGGTAATG CCGACTTTAC 5 040 

CGGTCACAAT ACCCTGACAG CCAAGGCCGA TGTCAATGCA GGATCGGTTG GTAAAGGCCG 5100 

TCTGAAAGCA GACAATACCA ATATCACTTC ATCTTCAGGA GATATTACGT TGGTTGCCGG 5160 
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CAACGGTATT CAGCTTGGTG ACGGAAAACA ACGCAATTCA ATCAACGGAA AACACATCAG 5220 

CATCAAAAAC AACGGTGGTA ATGCCGACTT AAAAAACCTT AACGTCCATG CCAAAAGCGG 528 0 

GGCATTGAAC ATTCATTCCG ACCGGGCATT GAGCATAGAA AATACCAAGC TGGAGTCTAC 5 34 0 

CCATAATACG CATCTTAATG CACAACACGA GCGGGTAACG CTCAACCAAG TAGATGCCTA 5 40 0 

CGCACACCGT CATCTAAGCA TTACCGGCAG CCAGATTTGG CAAAACGACA AACTGCCTTC 5 46 0 

TGCCAACAAG CTGGTGGCTA ACGGTGTATT GGCACTCAAT GCGCGCTATT CCCAAATTGC 5520 

CGACAACACC ACGCTGAGAG CGGGTGCAAT CAACCTTACT GCCGGTACCG CCCTAGTCAA 5580 

GCGCGGCAAC ATCAATTGGA GTACCGTTTC GACCAAGACT TTGGAAGATA ATGCCGAATT 5 640 

AAAACCATTG GCCGGACGGC TGAATATTGA AGCAGGTAGC GGCACATTAA CCATCGAACC 5 700 

TGCCAACCGC ATCAGTGCGC ATACCGACCT GAGCATCAAA ACAGGCGGAA AATTGCTGTT 5 760 

GTCTGCAAAA GGAGGAAATG CAGGTGCGCC TAGTGCTCAA GTTTCCTCAT TGGAAGCAAA 5 82 0 

AGGCAATATC CGTCTGGTTA CAGGAGAAAC AGATTTAAGA GGTTCTAAAA TTACAGCCGG 5 88 0 

TAAAAACTTG GTTGTCGCCA CCACCAAAGG CAAGTTGAAT ATCGAAGCCG TAAACAACTC 5 94 0 

ATTCAGCAAT TATTTTCCTA CACAAAAAGC GGCTGAACTC AACCAAAAAT CCAAAGAATT 6 00 0 

GGAACAGCAG ATTGCGCAGT TGAAAAAAAG CTCGCCTAAA AGCAAGCTGA TTCCAACCCT 6 060 

GCAAGAAGAA CGCGACCGTC TCGCTTTCTA TATTCAAGCC ATCAACAAGG AAGTTAAAGG 6120 

TAAAAAACCC AAAGGCAAAG AATACCTGCA AGCCAAGCTT TCTGCACAAA ATATTGACTT 6180 

GATTTCCGCA CAAGGCATCG AAATCAGCGG TTCCGATATT ACCGCTTCCA AAAAACTGAA 6240 

CCTTCACGCC GCAGGCGTAT TGCCAAAGGC AGCAGATTCA GAGGCGGCTG CTATTCTGAT 6 300 

TGACGGCATA ACCGACCAAT ATGAAATTGG CAAGCCCACC TACAAGAGTC ACTACGACAA 6 36 0 

AGCTGCTCTG AACAAGCCTT CACGTTTGAC CGGACGTACG GGGGTAAGTA TTCATGCAGC 6 42 0 

TGCGGCACTC GATGATGCAC GTATTATTAT CGGTGCATCC GAAATCAAAG CTCCCTCAGG 6 48 0 

CAGCATAGAC ATCAAAGCCC ATAGTGATAT TGTACTGGAG GCTGGACAAA ACGATGCCTA 6540 

TACCTTCTTA AAAACCAAAG GTAAAAGCGG CAAAATCATC AGAAAAACCA AGTTTACCAG 6 600 

CACCCGCGAC CACCTGATTA TGCCAGCCCC CGTCGAGCTG ACCGCCAACG GTATCACGCT 6 660 

TCAGGCAGGC GGCAACATCG AAGCTAATAC CACCCGCTTC AATGCCCCTG CAGGTAAAGT 6 720 

TACCCTGGTT GCGGGTGAAG AGCTGCAACT GCTGGCAGAA GAAGGCATCC ACAAGCACGA 6 780 

GTTGGATGTC CAAAAAAGCC GCCGCTTTAT CGGCATCAAG GTAGGTAAGA GCAATTACAG 6 840 

TAAAAACGAA CTGAACGAAA CCAAATTGCC TGTCCGCGTC GTCGCCCAAA CTGCAGCCAC 6 90 0 

CCGTTCAGGC TGGGATACCG TGCTCGAAGG TACCGAATTC AAAACCACGC TGGCCGGTGC 6 96 0 

CGACATTCAG GCAGGTGTAG GCGAAAAAGC CCGTGTCGAT GCGAAAATTA TCCTCAAAGG 7 02 0 

CATTGTGAAC CGTATCCAGT CGGAAGAAAA ATTAGAAACC AACTCAACCG TATGGCAGAA 7 080 

ACAGGCCGGA CGCGGCAGCA CTATCGAAAC GCTAAAACTG CCCAGCTTCG AAAGCCCTAC 7140 

TCCGCCCAAA TTGTCCGCAC CCGGCGGCTA TATCGTCGAC ATTCCGAAAG GCAATCTGAA 7200 

AACCGAAATC GAAAAGCTGT CCAAACAGCC CGAGTATGCC TATCTGAAAC AGCTCCAAGT 7260 

AGCGAAAl^C ATCAACTGGA ATCAGGTGCA GCTTGCTTAC GACAGATGGG ACTACAAACA 7 320 

GGAGGGCTTA ACCGAAGCAG GTGCGGCGAT TATCGCACTG GCCGTTACCG TGGTCACCTC 7 380 

AGGCGCAGGA ACCGGAGCCG TATTGGGATT AAACGGTGCG GCCGCCGCCG CAACCGATGC 7 44 0 

AGCATTCGCC TCTTTGGCCA GCCAGGCTTC CGTATCGTTC ATCAACAACA AAGGCGATGT 750 0 

CGGCAAAACC CTGAAAGAGC TGGGCAGAAG CAGCACGGTG AAAAATCTGG TGGTTGCCGC 756 0 
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CGCTACCGCA GGCGTAGCCG ACAAAATCGG CGCTTCGGCA CTGAACAATG TCAGCGATAA 7 620 

GCAGTGGATC AACAACCTGA CCGTCAACCT AGCCAATGCG GGCAGTGCCG CACTGATTAA 7 680 

TACCGCTGTC AACGGCGGCA GCCTGAAAGA CAATCTGGAA GCGAATATCC TTGCGGCTTT 7 740 

GGTCAATACC GCGCATGGAG AAGCAGCCAG TAAAATCAAA CAGTTGGATC AGCACTACAT 7 800 

AGTCCACAAG ATTGCCCATG CCATAGCGGG CTGTGCGGCA GCGGCGGCGA ATAAGGGCAA 7 860 

GTGTCAGGAT GGTGCGATAG GTGCGGCTGT GGGCGAGATA GTCGGGGAGG CTTTGACAAA 7 92 0 

CGGCAAAAAT CCTGACACTT TGACAGCTAA AGAACGCGAA CAGATTTTGG CATACAGCAA 7 98 0 

ACTGGTTGCC GGTACGGTAA GCGGTGTGGT CGGCGGCGAT GTAAATGCGG CGGCGAATGC 8 04 0 

GGCTGAGGTA GCGGTGAAAA ATAATCAGCT TAGCGACAAA GAGGGTAGAG AATTTGATAA 810 0 

CGAAATGACT GCATGCGCCA AACAGAATAA TCCTCAACTG TGCAGAAAAA ATACTGTAAA 8160 

AAAGTATCAA AATGTTGCTG ATAAAAGACT TGCTGCTTCG ATTGCAATAT GTACGGATAT 8220 

ATCCCGTAGT ACTGAATGTA GAACAATCAG AAAACAACAT TTGATCGATA GTAGAAGCCT 8280 

TCATTCATCT TGGGAAGCAG GTCTAATTGG TAAAGATGAT GAATGGTATA AATTATTCAG 8 340 

CAAATCTTAC ACCCAAGCAG ATTTGGCTTT ACAGTCTTAT CATTTGAATA CTGCTGCTAA 8 400 

ATCTTGGCTT CAATCGGGCA ATACAAAGCC TTTATCCGAA TGGATGTCCG ACCAAGGTTA 8 46 0 

TACACTTATT TCAGGAGTTA ATCCTAGATT CATTCCAATA CCAAGAGGGT TTGTAAAACA 8 52 0 

AAATACACCT ATTACTAATG TCAAATACCC GGAAGGCATC AGTTTCGATA CAAACCTAAA 858 0 

AAGACATCTG GCAAATGCTG ATGGTTTTAG TCAAGAACAG GGCATTAAAG GAGCCCATAA 8 640 

CCGCACCAAT TTTATGGCAG AACTAAATTC ACGAGGAGGA CGCGTAAAAT CTGAAACCCA 8 700 

AACTGATATT GAAGGCATTA CCCGAATTAA ATATGAGATT CCTACACTAG ACAGGACAGG 8 760 

TAAACCTGAT GGTGGATTTA AGGAAATTTC AAGTATAAAA ACTGTTTATA ATCCTAAAAA 8 820 

ATTTTCTGAT GATAAAATAC TTCAAATGGC TCAAAATGCT GCTTCACAAG GATATTCAAA 8 880 

AGCCTCTAAA ATTGCTCAAA ATGAAAGAAC TAAATCAATA TCGGAAAGAA AAAATGTCAT 8 940 

TCAATTCTCA GAAACCTTTG ACGGAATCAA ATTTAGATCA TATTTTGATG TAAATACAGG 9 00 0 

AAGAATTACA AACATTCACC CAGAATAATT TAAAGGAAAA ATTATGAAAA ATAATATTTT 9 06 0 

TCTAAACTTA AATAAAAAAT CTATAAATAA CAACCATTTT GTTATTTCGA TTTTTTTTGA 912 0 

AACAATTTAC CAATTTGAAA CTAAAGATAC GCTTTTAGAG TGTTTTAAAA ATATTACAAC 9180 

TACCGGACAT TTTGGAGTAA TAGGTGCTCA ATATGAAAAA ATAGATGCTA CCAGATGGAT 9240 

TGGAGATTAT GAAGAGGTAA ATGGATTTGA GTATATTGAT AAAGCTCCTT CTATTTATTT 9 300 

TTCAGTTGGA GATGATTTCA ATCCTGAAGA ATTAATTATA CCTATTAATT TAGCATATCA 9 360 

TTACTTTAAT ATTGCAATAT CTGATTTCTT AATAGCTCAC CCTGAATATC AAAAAAAGTG 9 420 

TAAAGAAATA CAAAAAACAT ATTCTCAAAC AAACTGTAGC CTGCATGAAA CCTAAAATCC 9 480 

ATGCGTAAGG TGTGTGCTTC AGCACGCACG CGTTCCATGA TTTACGGCTC AATGCCGTCT 954 0 

GAAAAGCTCA CAATTTTTCA GACGGCATTT GTTATGCAAG TAAATATTCA GATTCCCTAT 9 60 0 

ATACTGCCCA GACGCGTGCG TGCTGAAGAC ACCCCCTACG CTTGCTGCAG AACTTTCGGG 9 66 0 

TAAAACCGGT GTGAGCATTA GCGCACCGTA TGCCAATGAG AACAGTCGCA TCCTGCTCAG 9 720 

CACCACGGAT ATCAGTTCGG AAAACGGCAA AATCAAAATT CAATCTTACG GTGACCAATA 9 780 

TTACTATGCG AGACAGAGCG AACTCTATAC CTTTGAACGC CGCAGCTACA AAACTGGCAA 9 840 

ATGGTACAAC CGCAAACACA TTACCGAAGT CAAAGAACAC AAAAACGCCA AGCCCGACGC 9 900 
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AGTAAACCTC AGCGCATCCC AAGGCATCGA CATCAAATCT GGTGGCAGCA TCGACGCCTA 9 960 

CGCCACCGCA TTCGATGCCC CCAAAGGCAG CATTAACATC GAAGCCGGGC GGAAATTGAC 10 020 

ACTCTATGCC GTAGAAGAGC TCAACTACGA CAAACTAGAC AGCCAAAAAA GGCGCAGATT 10 08 0 

TCTCGGCATC AGCTACAGCA AAGCACACGA CACCACCACC CAAGTCATGA AAACCGCGCT 1014 0 

GCCCTCAAGG GTAGTTGCAG AATCAGCCAA CCTCCAATCG GGCTGGGATA CCAAACTGCA 1020 0 

AGGCACACAG TTTGAAACCA CACTGGGTGG CGCAACCATA CGCGCAGGCG TAGGTGAGCA 10260 

GGCACGGGCA GATGCCAAGA TTATCCTCGA AGGGATCAAA AGCAGCATCC ACACAGAAAC 10 320 

CGTGAGCAGC AGCAAATCTA CTCTATGGCA AAAACAGGCA GGACGGGGCA GTAACATCGA 10 380 

AACCTTGCAA TTGCCGAGTT TCACCGGTCC CGTTGCGCCC GTACTGTCCG CACCCGGCGG 10 440 

TTACATTGTC GACATTCCGA AAGGCAATCT GAAAACCCAA ATCGAAACCC TCACCAAGCA 10500 

GCCCGAGTAT GCTTATTTGA AACAACTTCA AGTTGCGAAA AACATCAACT GGAATCAGGT 1056 0 

GCAGCTTGCT TACGATAAAT GGGACTACAA ACAGGAGGGC ATGACACCCG CAGCAGCAGC 10 620 

TGTCGTCGTT ATCGTCGTAA CCGTATTGAC CTACGGTGCA CTGTCCGCCC CGGCAGCCGC 10 68 0 

CGGAACGGCG GGCGCGGCAG GCGCAGGAGC GGGAGGAGCC GCAGCAGGAA CGGCAGCCGG 10 74 0 

AACTGGAGTA GCAGCAGGAA CGGCAGCCAC AACCGGAGTA GCAGCAGGCA CATCAGCTGC 10 800 

AGCTATCACC ACAGCCGCAG GCAAAGCCGC ACTGGCCAGT CTCGCCAGCC AAGCCGCAGT 10 860 

TTCCCTCATC AACAACAAAG GAGACATAAA CCATACCCTG AAAGAACTGG GCAAAAGCAG 10 920 

CACCGTCAGA CAGGCCGCCA CCGCCGCCGT AACCGCAGGC GTACTGCAGG GCATAAGCGG 10 980 

GCTGAACACC CAAGCAGCCG AAGCCGTCAG CAAACATTTT CACAGTCCCG CAGCAGGCAA 11040 

ACTGACCGCT AACCTGATCA ACAGCACCGC TGCCGCAAGT GTCCATACCG CCATCAACGG 1110 0 

CGGCAGCCTG AAAGACAACT TGGGCGATGC CGCACTGGGT GCGATAGTCA GTACCGTACA 1116 0 

CGGAGAAGTA GCGAGCAAAA TCAAATTTAA TCTCAGCGAA GACTACATTG CCCACAAGAT 11220 

AGCCCATGCC GTAGCAGGCT GTGCATCGGC GGTAGCAAAT AAAGGCAAAT GTCGGGACGG 11280 

CGCAATCGGC GCGGCAGTCG GCGAGATGGT GGGAGAAACC CTGTTGGACG GACGCGATGT 11340 

AGGCAAACTG TCACCCCAAG AACGCCAAAA AGTCATAGCC TACTCGCAGA TTATCGCAGG 11400 

CAGCGCAGTG GCATTGGTTA AAGGGGATGT GAATACGGCG GTGAATGCGG CTACTGTGGC 11460 

AGTGGAGAAT AATAGTCTTT TAGCTCGCAG GAGGGTAAAT ATACGTTGGA CTCCGCGACA 11520 

AGAATTGGAA CATGAATATG CCATTCTTGA AATCCAGGCC ATTACCAATC AAATCCGAAG 11580 

GCTGGATCCG AAATTTAACG GGATTGCTAT TCTGAGGACT CCTGGAGAGC CGTGGACAAG 11640 

ACATGATGTA CAAACATACA GGCAATATTA TAATCAATTA AGGGAATCCA GAGGCTTTGC 11700 

TGTTGAACCA ATTTATAGAA TCAGGATAAA CAACGGCAAT GAATTTAACC GTATCATGTC 11760 

ATCAAAATAC CCTTATAATG AGCTTTATGT AGCCAATCCT AAATCGGCGA CGGGGTATTT 11820 

TAGGGTAGAT TCGTATGATC CTGCGACAAG GGAAATTATT TCAAGAAAAT TTACCCAATT 11880 

TTCTCAAATC CAAGAAAGTA CGGGGATTGG TTATATCAAG GAGGCTGTTA GAAAATATAG 11940 

CCCTGGTACT GTCATTTCCA ATGTTCCJy^G TACACCTACT ACGATAAGAG GAAGAAAGCT 12 000 

TGAAGGAl^ CTTATTTTAG AAGTTCCTGC TCAGGTCAAT CCAATTCCAC AATCTGTATT 12 060 

AAGGGCGGCA CAAGAAGAAA ATGTTATCAT TAGAGATACA ACAGGAAGGA TTTACAAATG 12120 

AAGAAAGATA TTTTTTATTG TGAGCAGTGG TCTTATGGTT ATAAGAGACT TCATAAGCCT 1218 0 

TTTTCTGAGA AACAAGCTGA GGAAAAACAT CTTAAAGGGG AGTTATATAC TGCCGTAATA 1224 0 

GGTTCGGCGA CACAACCTGA ATATGTAATT ACCTTGCGAG AGGAAGTAGG TTTTTTTTCG 12 30 0 
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GTAAATTTTT TCGATAAATT TGGAAGGGAT TATTTAACCC ATCAATTTCA AAAATATTCC 12 360 

AATTCGAATT ATTATTTTCT TTCTATGGCT GTATGGAGAG ATTATATAAC TTTGGAATCT 12 420 

CATGACTTAG CAGAAGGATA TACTTATTTC TTCAATGAAA ATACGGATGA TTGCTATGTT 12 480 

TTGAAACAAG ATTTTATTAA TAATGAGCGA TATGAAAAAA CAGAATTATA TTCCCAAAAA 12540 

GATAAGGTAA TTCTATTTCC AAAGTTTGGT GAATATGATT TGGTGTTAAA TCCGGACATT 12 600 

ATTTAATTAA GTTTTAAGGC CGTCTGAAAA AAATTTCAAA CGGCTTTTAT TATTGGGTTT 12 66 0 

GGAATCTGAG GATAAAGCTG ATAAAAACCA GGAAATTATC AGATTGCTAT ATACGTATTG 12 72 0 

TTGTACAGAC TAAAGGCAGC AATCAAATCA CTATTGCTTA CCCACAAAAA TAAATTGATT 12 78 0 

ATATGGAATA ATCATGAATA AGAGAATGAA AATGTGTCCT GCTTGTCAAC AAGGCTATCT 12 84 0 

CTACCATTCG AAACCTAAAT ATCTTCATGA TGAAATTATT CTGTGTGATG AATGCGATGC 12 900 

AGTATGGCTC AAAGGTATGA ATATATTTTA TGGAGAATAT GAAAAAGATT TTTATTCTTA 12 960 

TGTTCCTTTC ATGGAATCCC AAGGTATAAC GAGTGAATGT ATTTGGGAAG GAGATTTGTT 13 020 

TGATCATCCA TATTATGAAG ATGAAAACTC AAATGATATG GATTGATGGA AATTTTAAGC 13 080 

CTGCGTAGGT ACGATTAGCC ATCAAACGGC GTAATCATAC GCAAGATTAT CAACAGAGAG 13140 

GGCTGGCAGC GATATACCAC CCACAAGATT GCCCATGCCA TAGCGGGCTG TGCGGCAGCG 13200 

GCGGCGAATA AGGGCAAGTG TCAGGATGGT GCGATAGGCG CTGCAGTCGG TGAGATTGTT 1326 0 

GGTGAGGCTT TGGTTAAGAA TACTGATTTC AGTCGTATGA GTGCGACCGA AATCGAAAAA 13 320 

GCTAAAGCGA AGATTACTGC CTATTCAAAA CTGGTTGCCG GCACTGCGTC TGCCGTTGTA 13380 

GGCGGGGATG TGAATACAGC GGCGAATGCG GCACAGATAG CGGTGGAGAA TAATACTTTG 13 440 

TATCCTAGAT GCGTTGGTGC AAAGTGTGAT GAATTTCAAA AGGAACAACA AAAATGGATA 13500 

CGTGAAAATC CTGAAGAATA TCGAGAAGTT TTGCTTTTTC AGACAGGATT TATTCCAATT 13560 

ATCGGTGATA TACAGAGTTT TGTACAAGCA CAGACCGCTG CCGATCACCT GTTTGCTTTG 13 620 

CTGGGTGTGG TTCCGGGTAT CGGTGAATCG ATACAGGCCT ATAAAGTAGC GAAAGCGGCA 13 680 

AAAAATTTAC AAGGCATGAA AAAAGCCTTG GACAAGGCAG CAACCGTTGC CACTGCACAG 13 74 0 

GGCTATGTCA GCAAAACCAA AATCAAAATC GGTCAAACTG AATTAAGGGT TACTGCAGCA 13 800 

ACTGACAAAC AATTGCTGAA AGCTATTGGC GAAGGAAGGG ACACGACAGG TAAAATGACC 13 86 0 

GAGCAGTTAT TTGACTCTTT AGCTAAACAA AATGGCTTCA GAGTGCTTTC GGGCGGCAAA 13920 

TACGGCGGAA ATAACGGTTT TGATCATGTA TGGCAGGCTG CCGATGGTAG TGTCGTTTTG 13 980 

ATTGTAGAAA GTAAGCAGAT TAGGAACGGT ACGGTACAGC TGAATCCGAA TGGTGCGGGT 14 040 

GGATATACGC AAATGAGTGA GGATTGGATT AGACAAGTTT TAGATCAATT ACCCGATGGT 14100 

AGTCCCGCTA AAGCTGCTGT CTTCAAAGCA AATAAGAACG GCACATTAAA AACAGCAATA 14160 

GCAGGCGTTG ATCGTCAAAC AGGTAAGGCC GTTATTCTTC CTGTCAAAGT TCCTTCTAAA 14220 

ACCAATATAA GGAGATAACA ATGGGGCACA ATATGATGAC CACCCAAAAA TGGTATGAGC 1428 0 

ATATTACTAA TGTAATCATA GGCAATACTG CTAATTTCAA TAGCGGTTGC CTTGACTCTA 14 340 

TAGATTATGT AGATGAAAGA AAAGGCGTTC CGCTTGCAGC TATGCAACAT ATTTTCATGG 14 40 0 

ACGTTAGAGC TGCAGCTTCC CATGCCTATC TATTTGAACA TGATCTTAAG AAATTCAAGC 14 460 

AATATGCTTA TGTTGCAGGA AAGCTGGGGG TTTTGCTGAG TGTAAATTCT ACAGACCCTG 14520 

AACCCTTCTT CTTTCCCTGT GACATGCTCA ACATTCAAAA TCCGATGTTT CTGATGCTGA 14580 

TGAGCGACAG CCCACAGCTG CGTGAGTTTC TGGTGCGCAA TATCGACAAC ATCGCCAACG 14 640 
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ATACAGAAGC CTTTATAAAC CGCTACGACC TCAACCGGCA TATGATTTAC AATACTCTGC 14 700 

TGATGGTGGA GGGTAAGCAG CTTGATCGGT TGAAACAACG TAGCGAGAAA GTCTTGGCGC 14 76 0 

ATCCCACCCC TAGCAAATGG CTGCAAAAGC GGTTGTACGA TTACCGCTTC TTCCTCGCTT 14 820 

TCGCCGAACA GGATGCCGAG GCAATGAAAG CCGCCTTAGA GCCGCTTTTC GATAAAAAAA 14 880 

CCGCGCGTAT GGCTGCCAAA GAAACATTGT CCTATTTCGA TTTCTACCTG CAGCCGCAAA 14 94 0 

TCGTTACCTA CGCCAAAATC GCATCCATGC ACGGTTTCGA TTTGGGCATA GATCAAGAAA 15 000 

TCTCACCGAG GGATTTGATT GTTTACGATC CGCTGCCGGC AGACGAATAT CAAGACATCT 15 060 

TCGATTTTAT GAAACAGTAT GACTTGTCTT ACCCGTATGA ATATCTGCAG GATTGGATAG 15120 

ATTACTATAC GTTCAAAACC GATAAGCTGG TATTTGGTAA CGCGAAGCGA GAGTGAGCCG 15180 

TAAAACTCTG AGCTCCTGTT TTATAGATTA CAACTTTAGG CCGTCTTAAA GCTGAAAGAT 15240 

TTTCGAAAGC TATAAATTGA AGCCCTTCCA CAGTACATAG ATCTGTGTTG TGGCGGGGCT 15 30 0 

TTACCACGCT GATTGCCGGA GAAGAACTCA ACCTGCTGGC AAAACAAGGC ATGAGATCTT 15 36 0 

TGCAATAACA TGAGTTGAGA CCTTTGCAAA AAAGCCCTTC CCCGACATCC GAAACCCAAA 15 42 0 

CACAGGATTT CGGCTGTTTT CGTACCAAAT ACCTCCTAAT TTTACCCAAA TACCCCCTTA 15 48 0 

ATCCTCCTCG GACACCCGAT AATCAGGCAT CCGGGCTGCC TTTTAGGCGG CAGCGGGCGC 15540 

ATTTAGCCTG TTGGCCGCTT TCAACAGGTT CAAACACATC GCCTTCAGGT GGCTTTGCGC 15 600 

ACTCACTTTG TCATTTCCAA 15 620 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 580 acides amin,s 

(B) TYPE: acide amin, 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1 . .580 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Met Lys Phe Phe Pro Ala Pro Cys Leu Leu Val lie 
15 10 



Leu Ala Val He 
15 



Pro Leu Lys Thr Leu Ala Ala Asp Glu Asn Asp Ala 
20 25 



Glu Leu He Arg 
30 



Ser Met Gin Arg Gin Gin His He Asp Ala Glu Leu 

35 40 



Leu Thr Asp Ala 

45 



Asn Val Arg Phe Glu Gin Pro Leu Glu Lys Asn Asn Tyr Val Leu Ser 
50 55 60 



Glu Asp Glu Thr Pro Cys Thr Arg Val Asn Tyr He 
65 70 75 



Ser Leu Asp Asp 
80 



Lys Thr Ala Arg Lys Phe Ser Phe Leu Pro Ser Val 

85 90 



Leu Met Lys Glu 

95 



Thr Ala Phe Lys Thr Gly Met Cys Leu Gly Ser Asn 
100 105 



Asn Leu Ser Arg 
110 



Leu Gin Lys Ala Ala Gin Gin He Leu He Val Arg 
115 120 



Gly Tyr Leu Thr 
125 



Ser Gin Ala He He Gin Pro Gin Asn Met Asp Ser Gly He Leu Lys 

130 135 140 



Leu Arg Val Ser Ala Gly Glu He Gly Asp He Arg Tyr Glu Glu Lys 
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145 



150 



155 



160 



Arg Asp Gly Lys Ser Ala Glu Gly Ser lie Ser Ala Phe Asn Asn Lys 
165 170 175 

Phe Pro Leu Tyr Arg Asn Lys lie Leu Asn Leu Arg Asp Val Glu Gin 

180 185 190 

Gly Leu Glu Asn Leu Arg Arg Leu Pro Ser Val Lys Thr Asp lie Gin 
195 200 205 

lie lie Pro Ser Glu Glu Glu Gly Lys Ser Asp Leu Gin lie Lys Trp 
210 215 220 

Gin Gin Asn Lys Pro lie Arg Phe Ser lie Gly lie Asp Asp Ala Gly 

225 230 235 240 

Gly Lys Thr Thr Gly Lys Tyr Gin Gly Asn Val Ala Leu Ser Phe Asp 
245 250 255 

Asn Pro Leu Gly Leu Ser Asp Leu Phe Tyr Val Ser Tyr Gly Arg Gly 
260 265 270 

Leu Val His Lys Thr Asp Leu Thr Asp Ala Thr Gly Thr Glu Thr Glu 

275 280 285 

Ser Gly Ser Arg Ser Tyr Ser Val His Tyr Ser Val Pro Val Lys Lys 
290 295 300 

Trp Leu Phe Ser Phe Asn His Asn Gly His Arg Tyr His Glu Ala Thr 
305 310 315 320 

Glu Gly Tyr Ser Val Asn Tyr Asp Tyr Asn Gly Lys Gin Tyr Gin Ser 
325 330 335 

Ser Leu Ala Ala Glu Arg Met Leu Trp Arg Asn Arg Phe His Lys Thr 
340 345 350 

Ser Val Gly Met Lys Leu Trp Thr Arg Gin Thr Tyr Lys Tyr lie Asp 
355 360 365 

Asp Ala Glu lie Glu Val Gin Arg Arg Arg Ser Ala Gly Trp Glu Ala 
370 375 380 

Glu Leu Arg His Arg Ala Tyr Leu Asn Arg Trp Gin Leu Asp Gly Lys 
385 390 395 400 

Leu Ser Tyr Lys Arg Gly Thr Gly Met Arg Gin Ser Met Pro Ala Pro 
405 410 415 

Glu Glu Asn Gly Gly Gly Thr lie Pro Gly Thr Ser Arg Met Lys lie 
420 425 430 

lie Thr Ala Gly Leu Asp Ala Ala Ala Pro Phe Met Leu Gly Lys Gin 
435 440 445 

Gin Phe Phe Tyr Ala Thr Ala lie Gin Ala Gin Trp Asn Lys Thr Pro 
450 455 460 

Leu Val Ala Gin Asp Lys Leu Ser lie Gly Ser Arg Tyr Thr Val Arg 

465 470 475 480 

Gly Phe Asp Gly Glu Gin Ser Leu Phe Gly Glu Arg Gly Phe Tyr Trp 
485 490 495 

Gin Asn Thr Leu Thr Trp Tyr Phe His Pro Asn His Gin Phe Tyr Leu 
500 505 510 

Gly Ala Asp Tyr Gly Arg Val Ser Gly Glu Ser Ala Gin Tyr Val Ser 
515 520 525 

Gly Lys Gin Leu Met Gly Ala Val Val Gly Phe Arg Gly Gly His Lys 
530 535 540 

Val Gly Gly Met Phe Ala Tyr Asp Leu Phe Ala Gly Lys Pro Leu His 
545 550 555 560 

Lys Pro Lys Gly Phe Gin Thr Thr Asn Thr Val Tyr Gly Phe Asn Leu 



565 



570 



575 
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Asn Tyr Ser Phe 
580 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1981 acides amin,s 

(B) TYPE: acide amin, 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .1981 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Met Asn Lys Gly Leu His Arg lie lie Phe Ser Lys Lys His Ser Thr 

15 10 15 

Met Val Ala Val Ala Glu Thr Ala Asn Ser Gin Gly Lys Gly Lys Gin 
20 25 30 

Ala Gly Ser Ser Val Ser Val Ser Leu Lys Thr Ser Gly Asp Leu Cys 
35 40 45 

Gly Lys Leu Lys Thr Thr Leu Lys Thr Leu Val Cys Ser Leu Val Ser 

50 55 60 

Leu Ser Met Val Leu Pro Ala His Ala Gin lie Thr Thr Asp Lys Ser 
65 70 75 80 

Ala Pro Lys Asn Gin Gin Val Val lie Leu Lys Thr Asn Thr Gly Ala 
85 90 95 

Pro Leu Val Asn lie Gin Thr Pro Asn Gly Arg Gly Leu Ser His Asn 



Arg Tyr Thr Gin Phe Asp Val Asp Asn Lys Gly Ala Val Leu Asn Asn 
115 120 125 

Asp Arg Asn Asn Asn Pro Phe Leu Val Lys Gly Ser Ala Gin Leu lie 
130 135 140 

Leu Asn Glu Val Arg Gly Thr Ala Ser Lys Leu Asn Gly lie Val Thr 

145 150 155 160 

Val Gly Gly Gin Lys Ala Asp Val lie lie Ala Asn Pro Asn Gly lie 
165 170 175 

Thr Val Asn Gly Gly Gly Phe Lys Asn Val Gly Arg Gly lie Leu Thr 
180 185 190 

lie Gly Ala Pro Gin lie Gly Lys Asp Gly Ala Leu Thr Gly Phe Asp 

195 200 205 

Val Arg Gin Gly Thr Leu Thr Val Gly Ala Ala Gly Trp Asn Asp Lys 
210 215 220 

Gly Gly Ala Asp Tyr Thr Gly Val Leu Ala Arg Ala Val Ala Leu Gin 
225 230 235 240 

Gly Lys Leu Gin Gly Lys Asn Leu Ala Val Ser Thr Gly Pro Gin Lys 

245 250 255 

Val Asp Tyr Ala Ser Gly Glu lie Ser Ala Gly Thr Ala Ala Gly Thr 
260 265 270 

Lys Pro Thr lie Ala Leu Asp Thr Ala Ala Leu Gly Gly Met Tyr Ala 

275 280 285 

Asp Ser lie Thr Leu lie Ala Asn Glu Lys Gly Val Gly Val Lys Asn 

290 295 300 



100 



105 



110 



Ala Gly Thr Leu Glu Ala Ala Lys Gin Leu lie Val Thr Ser Ser Gly 
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305 



310 



315 



320 



Arg lie Glu Asn Ser Gly Arg lie Ala Thr Thr Ala Asp Gly Thr Glu 
325 330 335 

Ala Ser Pro Thr Tyr Leu Ser lie Glu Thr Thr Glu Lys Gly Ala Ala 

340 345 350 

Gly Thr Phe lie Ser Asn Gly Gly Arg lie Glu Ser Lys Gly Leu Leu 
355 360 365 

Val He Glu Thr Gly Glu Asp He Ser Leu Arg Asn Gly Ala Val Val 
370 375 380 

Gin Asn Asn Gly Ser Arg Pro Ala Thr Thr Val Leu Asn Ala Gly His 

385 390 395 400 

Asn Leu Val He Glu Ser Lys Thr Asn Val Asn Asn Ala Lys Gly Ser 
405 410 415 

Ala Asn Leu Ser Ala Gly Gly Arg Thr Thr He Asn Asp Ala Thr He 
420 425 430 

Gin Ala Gly Ser Ser Val Tyr Ser Ser Thr Lys Gly Asp Thr Glu Leu 

435 440 445 

Gly Glu Asn Thr Arg He He Ala Glu Asn Val Thr Val Leu Ser Asn 
450 455 460 

Gly Ser He Gly Ser Ala Ala Val He Glu Ala Lys Asp Thr Ala His 
465 470 475 480 

He Glu Ser Gly Lys Pro Leu Ser Leu Glu Thr Ser Thr Val Ala Ser 
485 490 495 

Asn He Arg Leu Asn Asn Gly Asn He Lys Gly Gly Lys Gin Leu Ala 
500 505 510 

Leu Leu Ala Asp Asp Asn He Thr Ala Lys Thr Thr Asn Leu Asn Thr 
515 520 525 

Pro Gly Asn Leu Tyr Val His Thr Gly Lys Asp Leu Asn Leu Asn Val 
530 535 540 

Asp Lys Asp Leu Ser Ala Ala Ser He His Leu Lys Ser Asp Asn Ala 
545 550 555 560 

Ala His He Thr Gly Thr Ser Lys Thr Leu Thr Ala Ser Lys Asp Met 
565 570 575 

Gly Val Glu Ala Gly Leu Leu Asn Val Thr Asn Thr Asn Leu Arg Thr 
580 585 590 

Asn Ser Gly Asn Leu His He Gin Ala Ala Lys Gly Asn He Gin Leu 
595 600 605 

Arg Asn Thr Lys Leu Asn Ala Ala Lys Ala Leu Glu Thr Thr Ala Leu 
610 615 620 

Gin Gly Asn He Val Ser Asp Gly Leu His Ala Val Ser Ala Asp Gly 
625 630 635 640 

His Val Ser Leu Leu Ala Asn Gly Asn Ala Asp Phe Thr Gly His Asn 
645 650 655 

Thr Leu Thr Ala Lys Ala Asp Val Asn Ala Gly Ser Val Gly Lys Gly 
660 665 670 

Arg Leu Lys Ala Asp Asn Thr Asn He Thr Ser Ser Ser Gly Asp He 
675 680 685 

Thr Leu Val Ala Gly Asn Gly He Gin Leu Gly Asp Gly Lys Gin Arg 
690 695 700 

Asn Ser He Asn Gly Lys His He Ser He Lys Asn Asn Gly Gly Asn 
705 710 715 720 

Ala Asp Leu Lys Asn Leu Asn Val His Ala Lys Ser Gly Ala Leu Asn 



725 



730 



735 
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He His Ser Asp Arg Ala Leu Ser He Glu Asn Thr Lys Leu Glu Ser 

740 745 750 

Thr His Asn Thr His Leu Asn Ala Gin His Glu Arg Val Thr Leu Asn 
755 760 765 

Gin Val Asp Ala Tyr Ala His Arg His Leu Ser He Thr Gly Ser Gin 
770 775 780 

He Trp Gin Asn Asp Lys Leu Pro Ser Ala Asn Lys Leu Val Ala Asn 

785 790 795 800 

Gly Val Leu Ala Leu Asn Ala Arg Tyr Ser Gin He Ala Asp Asn Thr 
805 810 815 

Thr Leu Arg Ala Gly Ala He Asn Leu Thr Ala Gly Thr Ala Leu Val 
820 825 830 

Lys Arg Gly Asn He Asn Trp Ser Thr Val Ser Thr Lys Thr Leu Glu 

835 840 845 

Asp Asn Ala Glu Leu Lys Pro Leu Ala Gly Arg Leu Asn He Glu Ala 
850 855 860 

Gly Ser Gly Thr Leu Thr He Glu Pro Ala Asn Arg He Ser Ala His 
865 870 875 880 

Thr Asp Leu Ser He Lys Thr Gly Gly Lys Leu Leu Leu Ser Ala Lys 

885 890 895 

Gly Gly Asn Ala Gly Ala Pro Ser Ala Gin Val Ser Ser Leu Glu Ala 
900 905 910 

Lys Gly Asn He Arg Leu Val Thr Gly Glu Thr Asp Leu Arg Gly Ser 
915 920 925 

Lys He Thr Ala Gly Lys Asn Leu Val Val Ala Thr Thr Lys Gly Lys 

930 935 940 

Leu Asn He Glu Ala Val Asn Asn Ser Phe Ser Asn Tyr Phe Pro Thr 
945 950 955 960 

Gin Lys Ala Ala Glu Leu Asn Gin Lys Ser Lys Glu Leu Glu Gin Gin 
965 970 975 

He Ala Gin Leu Lys Lys Ser Ser Pro Lys Ser Lys Leu He Pro Thr 

980 985 990 

Leu Gin Glu Glu Arg Asp Arg Leu Ala Phe Tyr He Gin Ala He Asn 
995 1000 1005 

Lys Glu Val Lys Gly Lys Lys Pro Lys Gly Lys Glu Tyr Leu Gin Ala 
1010 1015 1020 

Lys Leu Ser Ala Gin Asn He Asp Leu He Ser Ala Gin Gly He Glu 

1025 1030 1035 104( 

He Ser Gly Ser Asp He Thr Ala Ser Lys Lys Leu Asn Leu His Ala 
1045 1050 1055 

Ala Gly Val Leu Pro Lys Ala Ala Asp Ser Glu Ala Ala Ala He Leu 
1060 1065 1070 

He Asp Gly He Thr Asp Gin Tyr Glu He Gly Lys Pro Thr Tyr Lys 
1075 1080 1085 

Ser His Tyr Asp Lys Ala Ala Leu Asn Lys Pro Ser Arg Leu Thr Gly 
1090 1095 1100 

Arg Thr Gly Val Ser He His Ala Ala Ala Ala Leu Asp Asp Ala Arg 
1105 1110 1115 112( 

He He He Gly Ala Ser Glu He Lys Ala Pro Ser Gly Ser He Asp 



1125 



1130 



1135 



He Lys Ala His Ser Asp He Val Leu Glu Ala Gly Gin Asn Asp Ala 
1140 1145 1150 
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Tyr Thr Phe Leu Lys Thr Lys Gly Lys Ser Gly Lys lie lie Arg Lys 
1155 1160 1165 

Thr Lys Phe Thr Ser Thr Arg Asp His Leu lie Met Pro Ala Pro Val 

1170 1175 1180 

Glu Leu Thr Ala Asn Gly lie Thr Leu Gin Ala Gly Gly Asn lie Glu 
1185 1190 1195 1200 

Ala Asn Thr Thr Arg Phe Asn Ala Pro Ala Gly Lys Val Thr Leu Val 
1205 1210 1215 

Ala Gly Glu Glu Leu Gin Leu Leu Ala Glu Glu Gly lie His Lys His 
1220 1225 1230 

Glu Leu Asp Val Gin Lys Ser Arg Arg Phe lie Gly lie Lys Val Gly 
1235 1240 1245 

Lys Ser Asn Tyr Ser Lys Asn Glu Leu Asn Glu Thr Lys Leu Pro Val 
1250 1255 1260 

Arg Val Val Ala Gin Thr Ala Ala Thr Arg Ser Gly Trp Asp Thr Val 

1265 1270 1275 1280 

Leu Glu Gly Thr Glu Phe Lys Thr Thr Leu Ala Gly Ala Asp lie Gin 
1285 1290 1295 

Ala Gly Val Gly Glu Lys Ala Arg Val Asp Ala Lys lie lie Leu Lys 
1300 1305 1310 

Gly lie Val Asn Arg lie Gin Ser Glu Glu Lys Leu Glu Thr Asn Ser 

1315 1320 1325 

Thr Val Trp Gin Lys Gin Ala Gly Arg Gly Ser Thr lie Glu Thr Leu 
1330 1335 1340 

Lys Leu Pro Ser Phe Glu Ser Pro Thr Pro Pro Lys Leu Ser Ala Pro 
1345 1350 1355 1360 

Gly Gly Tyr lie Val Asp lie Pro Lys Gly Asn Leu Lys Thr Glu lie 

1365 1370 1375 

Glu Lys Leu Ser Lys Gin Pro Glu Tyr Ala Tyr Leu Lys Gin Leu Gin 
1380 1385 1390 

Val Ala Lys Asn lie Asn Trp Asn Gin Val Gin Leu Ala Tyr Asp Arg 
1395 1400 1405 

Trp Asp Tyr Lys Gin Glu Gly Leu Thr Glu Ala Gly Ala Ala lie lie 

1410 1415 1420 

Ala Leu Ala Val Thr Val Val Thr Ser Gly Ala Gly Thr Gly Ala Val 
1425 1430 1435 1440 

Leu Gly Leu Asn Gly Ala Ala Ala Ala Ala Thr Asp Ala Ala Phe Ala 
1445 1450 1455 

Ser Leu Ala Ser Gin Ala Ser Val Ser Phe lie Asn Asn Lys Gly Asp 

1460 1465 1470 

Val Gly Lys Thr Leu Lys Glu Leu Gly Arg Ser Ser Thr Val Lys Asn 
1475 1480 1485 

Leu Val Val Ala Ala Ala Thr Ala Gly Val Ala Asp Lys He Gly Ala 
1490 1495 1500 

Ser Ala Leu Asn Asn Val Ser Asp Lys Gin Trp He Asn Asn Leu Thr 

1505 1510 1515 1520 

Val Asn Leu Ala Asn Ala Gly Ser Ala Ala Leu He Asn Thr Ala Val 
1525 1530 1535 

Asn Gly Gly Ser Leu Lys Asp Asn Leu Glu Ala Asn He Leu Ala Ala 
1540 1545 1550 

Leu Val Asn Thr Ala His Gly Glu Ala Ala Ser Lys He Lys Gin Leu 
1555 1560 1565 



Asp Gin His Tyr He Val His Lys He Ala His Ala He Ala Gly Cys 



us 7,029,845 B2 



73 



74 



-continued 



1570 



1575 



1580 



Ala Ala Ala Ala Ala Asn Lys Gly Lys Cys Gin Asp Gly Ala lie Gly 
1585 1590 1595 160( 

Ala Ala Val Gly Glu He Val Gly Glu Ala Leu Thr Asn Gly Lys Asn 
1605 1610 1615 

Pro Asp Thr Leu Thr Ala Lys Glu Arg Glu Gin He Leu Ala Tyr Ser 
1620 1625 1630 

Lys Leu Val Ala Gly Thr Val Ser Gly Val Val Gly Gly Asp Val Asn 
1635 1640 1645 

Ala Ala Ala Asn Ala Ala Glu Val Ala Val Lys Asn Asn Gin Leu Ser 
1650 1655 1660 

Asp Lys Glu Gly Arg Glu Phe Asp Asn Glu Met Thr Ala Cys Ala Lys 
1665 1670 1675 168( 

Gin Asn Asn Pro Gin Leu Cys Arg Lys Asn Thr Val Lys Lys Tyr Gin 
1685 1690 1695 

Asn Val Ala Asp Lys Arg Leu Ala Ala Ser He Ala He Cys Thr Asp 
1700 1705 1710 

He Ser Arg Ser Thr Glu Cys Arg Thr He Arg Lys Gin His Leu He 
1715 1720 1725 

Asp Ser Arg Ser Leu His Ser Ser Trp Glu Ala Gly Leu He Gly Lys 
1730 1735 1740 

Asp Asp Glu Trp Tyr Lys Leu Phe Ser Lys Ser Tyr Thr Gin Ala Asp 
1745 1750 1755 176( 

Leu Ala Leu Gin Ser Tyr His Leu Asn Thr Ala Ala Lys Ser Trp Leu 
1765 1770 1775 

Gin Ser Gly Asn Thr Lys Pro Leu Ser Glu Trp Met Ser Asp Gin Gly 
1780 1785 1790 

Tyr Thr Leu He Ser Gly Val Asn Pro Arg Phe He Pro He Pro Arg 
1795 1800 1805 

Gly Phe Val Lys Gin Asn Thr Pro He Thr Asn Val Lys Tyr Pro Glu 
1810 1815 1820 

Gly He Ser Phe Asp Thr Asn Leu Lys Arg His Leu Ala Asn Ala Asp 
1825 1830 1835 184( 

Gly Phe Ser Gin Glu Gin Gly He Lys Gly Ala His Asn Arg Thr Asn 
1845 1850 1855 

Phe Met Ala Glu Leu Asn Ser Arg Gly Gly Arg Val Lys Ser Glu Thr 
1860 1865 1870 

Gin Thr Asp He Glu Gly He Thr Arg He Lys Tyr Glu He Pro Thr 
1875 1880 1885 

Leu Asp Arg Thr Gly Lys Pro Asp Gly Gly Phe Lys Glu He Ser Ser 

1890 1895 1900 

He Lys Thr Val Tyr Asn Pro Lys Lys Phe Ser Asp Asp Lys He Leu 
1905 1910 1915 192( 

Gin Met Ala Gin Asn Ala Ala Ser Gin Gly Tyr Ser Lys Ala Ser Lys 
1925 1930 1935 

He Ala Gin Asn Glu Arg Thr Lys Ser He Ser Glu Arg Lys Asn Val 
1940 1945 1950 

He Gin Phe Ser Glu Thr Phe Asp Gly He Lys Phe Arg Ser Tyr Phe 
1955 1960 1965 

Asp Val Asn Thr Gly Arg He Thr Asn He His Pro Glu 
1970 1975 1980 



(2) INFORMATION FOR SEQ ID NO: 39: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 acides amin,s 

(B) TYPE: acide amin, 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION:!. .143 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Met Lys Asn Asn lie Phe Leu Asn Leu Asn Lys Lys Ser lie Asn Asn 
15 10 15 

Asn His Phe Val He Ser He Phe Phe Glu Thr He Tyr Gin Phe Glu 
20 25 30 

Thr Lys Asp Thr Leu Leu Glu Cys Phe Lys Asn He Thr Thr Thr Gly 

35 40 45 

His Phe Gly Val He Gly Ala Gin Tyr Glu Lys He Asp Ala Thr Arg 
50 55 60 

Trp He Gly Asp Tyr Glu Glu Val Asn Gly Phe Glu Tyr He Asp Lys 
65 70 75 80 

Ala Pro Ser He Tyr Phe Ser Val Gly Asp Asp Phe Asn Pro Glu Glu 

85 90 95 

Leu He He Pro He Asn Leu Ala Tyr His Tyr Phe Asn He Ala He 
100 105 110 

Ser Asp Phe Leu He Ala His Pro Glu Tyr Gin Lys Lys Cys Lys Glu 
115 120 125 

He Gin Lys Thr Tyr Ser Gin Thr Asn Cys Ser Leu His Glu Thr 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 acides amin,s 

(B) TYPE: acide amin, 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION:!. .833 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Val Leu Lys Thr Pro Pro Thr Leu Ala Ala Glu Leu Ser Gly Lys Thr 
15 10 15 

Gly Val Ser He Ser Ala Pro Tyr Ala Asn Glu Asn Ser Arg He Leu 
20 25 30 

Leu Ser Thr Thr Asp He Ser Ser Glu Asn Gly Lys He Lys He Gin 
35 40 45 

Ser Tyr Gly Asp Gin Tyr Tyr Tyr Ala Arg Gin Ser Glu Leu Tyr Thr 

50 55 60 

Phe Glu Arg Arg Ser Tyr Lys Thr Gly Lys Trp Tyr Asn Arg Lys His 
65 70 75 80 

He Thr Glu Val Lys Glu His Lys Asn Ala Lys Pro Asp Ala Val Asn 
85 90 95 



Leu Ser Ala Ser Gin Gly He Asp He Lys Ser Gly Gly Ser He Asp 
100 105 110 
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Ala Tyr Ala Thr Ala Phe Asp Ala Pro Lys Gly Ser lie Asn lie Glu 
115 120 125 

Ala Gly Arg Lys Leu Thr Leu Tyr Ala Val Glu Glu Leu Asn Tyr Asp 
130 135 140 

Lys Leu Asp Ser Gin Lys Arg Arg Arg Phe Leu Gly lie Ser Tyr Ser 
145 150 155 160 

Lys Ala His Asp Thr Thr Thr Gin Val Met Lys Thr Ala Leu Pro Ser 
165 170 175 

Arg Val Val Ala Glu Ser Ala Asn Leu Gin Ser Gly Trp Asp Thr Lys 
180 185 190 

Leu Gin Gly Thr Gin Phe Glu Thr Thr Leu Gly Gly Ala Thr lie Arg 
195 200 205 

Ala Gly Val Gly Glu Gin Ala Arg Ala Asp Ala Lys He He Leu Glu 
210 215 220 

Gly He Lys Ser Ser He His Thr Glu Thr Val Ser Ser Ser Lys Ser 
225 230 235 240 

Thr Leu Trp Gin Lys Gin Ala Gly Arg Gly Ser Asn He Glu Thr Leu 
245 250 255 

Gin Leu Pro Ser Phe Thr Gly Pro Val Ala Pro Val Leu Ser Ala Pro 

260 265 270 

Gly Gly Tyr He Val Asp He Pro Lys Gly Asn Leu Lys Thr Gin He 
275 280 285 

Glu Thr Leu Thr Lys Gin Pro Glu Tyr Ala Tyr Leu Lys Gin Leu Gin 
290 295 300 

Val Ala Lys Asn He Asn Trp Asn Gin Val Gin Leu Ala Tyr Asp Lys 

305 310 315 320 

Trp Asp Tyr Lys Gin Glu Gly Met Thr Pro Ala Ala Ala Ala Val Val 
325 330 335 

Val He Val Val Thr Val Leu Thr Tyr Gly Ala Leu Ser Ala Pro Ala 
340 345 350 

Ala Ala Gly Thr Ala Gly Ala Ala Gly Ala Gly Ala Gly Gly Ala Ala 

355 360 365 

Ala Gly Thr Ala Ala Gly Thr Gly Val Ala Ala Gly Thr Ala Ala Thr 
370 375 380 

Thr Gly Val Ala Ala Gly Thr Ser Ala Ala Ala He Thr Thr Ala Ala 
385 390 395 400 

Gly Lys Ala Ala Leu Ala Ser Leu Ala Ser Gin Ala Ala Val Ser Leu 

405 410 415 

He Asn Asn Lys Gly Asp He Asn His Thr Leu Lys Glu Leu Gly Lys 
420 425 430 

Ser Ser Thr Val Arg Gin Ala Ala Thr Ala Ala Val Thr Ala Gly Val 
435 440 445 

Leu Gin Gly He Ser Gly Leu Asn Thr Gin Ala Ala Glu Ala Val Ser 

450 455 460 

Lys His Phe His Ser Pro Ala Ala Gly Lys Leu Thr Ala Asn Leu He 
465 470 475 480 

Asn Ser Thr Ala Ala Ala Ser Val His Thr Ala He Asn Gly Gly Ser 
485 490 495 

Leu Lys Asp Asn Leu Gly Asp Ala Ala Leu Gly Ala He Val Ser Thr 

500 505 510 

Val His Gly Glu Val Ala Ser Lys He Lys Phe Asn Leu Ser Glu Asp 



515 



520 



525 
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Tyr lie Ala His Lys lie Ala His Ala Val Ala Gly Cys Ala Ser Ala 
530 535 540 

Val Ala Asn Lys Gly Lys Cys Arg Asp Gly Ala lie Gly Ala Ala Val 

545 550 555 560 

Gly Glu Met Val Gly Glu Thr Leu Leu Asp Gly Arg Asp Val Gly Lys 
565 570 575 

Leu Ser Pro Gin Glu Arg Gin Lys Val lie Ala Tyr Ser Gin lie lie 
580 585 590 

Ala Gly Ser Ala Val Ala Leu Val Lys Gly Asp Val Asn Thr Ala Val 

595 600 605 

Asn Ala Ala Thr Val Ala Val Glu Asn Asn Ser Leu Leu Ala Arg Arg 
610 615 620 

Arg Val Asn lie Arg Trp Thr Pro Arg Gin Glu Leu Glu His Glu Tyr 
625 630 635 640 

Ala lie Leu Glu lie Gin Ala lie Thr Asn Gin lie Arg Arg Leu Asp 

645 650 655 

Pro Lys Phe Asn Gly lie Ala lie Leu Arg Thr Pro Gly Glu Pro Trp 
660 665 670 

Thr Arg His Asp Val Gin Thr Tyr Arg Gin Tyr Tyr Asn Gin Leu Arg 
675 680 685 

Glu Ser Arg Gly Phe Ala Val Glu Pro lie Tyr Arg lie Arg lie Asn 

690 695 700 

Asn Gly Asn Glu Phe Asn Arg lie Met Ser Ser Lys Tyr Pro Tyr Asn 
705 710 715 720 

Glu Leu Tyr Val Ala Asn Pro Lys Ser Ala Thr Gly Tyr Phe Arg Val 
725 730 735 

Asp Ser Tyr Asp Pro Ala Thr Arg Glu lie lie Ser Arg Lys Phe Thr 

740 745 750 

Gin Phe Ser Gin lie Gin Glu Ser Thr Gly lie Gly Tyr lie Lys Glu 
755 760 765 

Ala Val Arg Lys Tyr Ser Pro Gly Thr Val lie Ser Asn Val Pro Ser 
770 775 780 

Thr Pro Thr Thr lie Arg Gly Arg Lys Leu Glu Gly Lys Leu lie Leu 

785 790 795 800 

Glu Val Pro Ala Gin Val Asn Pro lie Pro Gin Ser Val Leu Arg Ala 
805 810 815 

Ala Gin Glu Glu Asn Val lie lie Arg Asp Thr Thr Gly Arg lie Tyr 
820 825 830 

Lys 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 664 acides amin,s 

(B) TYPE: acide amin, 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Val leu Lys Thr Pro Pro Thr Leu Ala Ala Glu leu Ser Gly Lys Thr 
1 5 10 15 

Gly Val Ser lie Ser Ala Pro Tyr Ala Asn Glu Asn Ser Arg lie Leu 
20 25 30 



Leu Ser Thr Thr Asp lie Ser Ser Glu Asn Gly Lys lie Lys lie Gin 
35 40 45 
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Ser Tyr Gly Asp Gin Tyr Tyr Tyr Ala Arg Gin Ser Glu Leu Tyr Thr 

50 55 60 

Phe Glu Arg Arg Ser Tyr Lys Thr Gly Lys Trp Tyr Asn Arg Lys His 
65 70 75 80 

lie Thr Glu Val Lys Glu His Lys Asn Ala Lys pro Asp Ala Val Asn 

85 90 95 

Leu Ser Ala Ser Gin Gly lie Asp lie Lys Ser Gly Gly Ser lie Asp 
100105110 

Ala Tyr Ala Thr Ala Phe Asp Ala Pro Lys Gly Ser lie Asn lie Glu 
115120125 

Ala GlyArg Lys Leu Thr leu Tyr Ala Val Glu Glu Leu Asn Tyr Asp 

130135 140 

Lys leu Asp Ser Gin Lys Arg Arg Arg Phe Leu Gly lie Ser Tyr Ser 
145150155160 



Lys Ala His Asp Thr Thr Thr Gin Val Met Lys Thr Ala Leu Pro Ser 

165 170 175 

Arg Val Val Ala Glu Ser Ala Asn Leu Gin Ser Gly Trp Asp Thr Lys 
180 185 190 

Leu Gin Gly Thr Gin Phe Glu Thr Thr Leu Gly Gly Ala Thr lie Arg 
195 200 205 

Ala Gly Val Gly Glu Gin Ala Arg Ala Asp Ala Lys lie lie Leu Glu 
210 215 220 

Gly lie Lys Ser Ser lie His Thr Glu Thr Val Ser Ser Ser Lys Ser 
225 230 235 240 

Thr Leu Trp Gin Lys Gin Ala Gly Arg Gly Ser Asn lie Glu Thr Leu 
245 250 255 

Gin Leu Pro Ser Phe Thr Gly Pro Val Ala Pro Val Leu Ser Ala Pro 
260 265 270 

Gly Gly Tyr lie Val Asp lie Pro Lys Gly Asn Leu Lys Thr Gin lie 
275 280 285 

Glu Thr Leu Thr Lys Gin Pro Glu Tyr Ala Tyr Leu Lys Gin Leu Gin 
290 295 300 

Val Ala Lys Asn lie Asn Trp Asn Gin Val Gin Leu Ala Tyr Asp Lys 
305 310 315 320 

Trp Asp Tyr Lys Gin Glu Gly Met Thr Pro Ala Ala Ala Ala Val Val 
325 330 335 

Val He Val Val Thr Val Leu Thr Tyr Gly Ala Leu Ser Ala Pro Ala 
340 345 350 

Ala Ala Gly Thr Ala Gly Ala Ala Gly Ala Gly Ala Gly Gly Ala Ala 
355 360 365 

Ala Gly Thr Ala Ala Gly Thr Gly Val Ala Ala Gly Thr Ala Ala Thr 
370 375 380 

Thr Gly Val Ala Ala Gly Thr Ser Ala Ala Ala He Thr Thr Ala Ala 
385 390 395 400 

Gly Lys Ala Ala Leu Ala Ser Leu Ala Ser Gin Ala Ala Val Ser Leu 

405 410 415 

He Asn Asn Lys Gly Asp He Asn His Thr Leu Lys Glu Leu Gly Lys 
420 425 430 

Ser Ser Thr Val Arg Gin Ala Ala Thr Ala Ala Val Thr Ala Gly Val 
435 440 445 



Leu Gin Gly He Ser Gly Leu Asn Thr Gin Ala Ala Glu Ala Val Ser 
450 455 460 
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Lys His Phe His Ser Pro Ala Ala Gly Lys Leu Thr Ala Asn Leu lie 
465 470 475 480 

Asn Ser Thr Ala Ala Ala Ser Val His Thr Ala lie Asn Gly Gly Ser 
485 490 495 

Leu Lys Asp Asn Leu Gly Asp Ala Ala Leu Gly Ala lie Val Ser Thr 
500 505 510 

Val His Gly Glu Val Ala Ser Lys lie Lys Phe Asn Leu Ser Glu Asp 

515 520 525 

Tyr He Ala His Lys He Ala His Ala Val Ala Gly Cys Ala Ser Ala 
530 535 540 

Val Ala Asn Lys Gly Lys Cys Arg Asp Gly Ala He Gly Ala Ala Val 
545 550 555 560 

Gly Glu Met Val Gly Glu Thr Leu Leu Asp Gly Arg Asp Val Gly Lys 

565 570 575 

Leu Ser Pro Gin Glu Arg Gin Lys Val He Ala Tyr Ser Gin He He 
580 585 590 

Ala Gly Ser Ala Val Ala Leu Val Lys Gly Asp Val Asn Thr Ala Val 
595 600 605 

Asn Ala Ala Thr Val Ala Val Glu Asn Asn Ser Leu Leu Ala Arg Arg 

610 615 620 

Arg Val Asn He Arg Trp Thr Pro Arg Gin Glu Leu Glu His Glu Tyr 
625 630 635 640 

Ala He Leu Glu He Gin Ala He Thr Asn Gin He Arg Arg Leu Asp 
645 650 655 

Pro Lys Phe Asn Gly He Ala He Leu Arg Thr Pro Gly Glu Pro Trp 

660 665 670 

Thr Arg His Asp Val Gin Thr Tyr Arg Gin Tyr Tyr Asn Gin Leu Arg 
675 680 685 

Glu Ser Arg Gly Phe Ala Val Glu Pro He Tyr Arg He Arg He Asn 
690 695 700 

Asn Gly Asn Glu Phe Asn Arg He Met Ser Ser Lys Tyr Pro Tyr Asn 

705 710 715 720 

Glu Leu Tyr Val Ala Asn Pro Lys Ser Ala Thr Gly Tyr Phe Arg Val 
725 730 735 

Asp Ser Tyr Asp Pro Ala Thr Arg Glu He He Ser Arg Lys Phe Thr 
740 745 750 

Gin Phe Ser Gin He Gin Glu Ser Thr Gly He Gly Tyr He Lys Glu 

755 760 765 

Ala Val Arg Lys Tyr Ser Pro Gly Thr Val He Ser Asn Val Pro Ser 
770 775 780 

Thr Pro Thr Thr He Arg Gly Arg Lys Leu Glu Gly Lys Leu He Leu 
785 790 795 800 

Glu Val Pro Ala Gin Val Asn Pro He Pro Gin Ser Val Leu Arg Ala 

805 810 815 

Ala Gin Glu Glu Asn Val He He Arg Asp Thr Thr Gly Arg He Tyr 
820 825 830 

Lys 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 acides amin,s 

(B) TYPE: acide amin, 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 .. 162 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Lys Lys Asp lie Phe Tyr Cys Glu Gin Trp Ser Tyr Gly Tyr Lys 
15 10 15 

Arg Leu His Lys Pro Phe Ser Glu Lys Gin Ala Glu Glu Lys His Leu 
20 25 30 

Lys Gly Glu Leu Tyr Thr Ala Val lie Gly Ser Ala Thr Gin Pro Glu 
35 40 45 

Tyr Val He Thr Leu Arg Glu Glu Val Gly Phe Phe Ser Val Asn Phe 
50 55 60 

Phe Asp Lys Phe Gly Arg Asp Tyr Leu Thr His Gin Phe Gin Lys Tyr 
65 70 75 80 

Ser Asn Ser Asn Tyr Tyr Phe Leu Ser Met Ala Val Trp Arg Asp Tyr 
85 90 95 

He Thr Leu Glu Ser His Asp Leu Ala Glu Gly Tyr Thr Tyr Phe Phe 

100 105 110 

Asn Glu Asn Thr Asp Asp Cys Tyr Val Leu Lys Gin Asp Phe He Asn 
115 120 125 

Asn Glu Arg Tyr Glu Lys Thr Glu Leu Tyr Ser Gin Lys Asp Lys Val 
130 135 140 

He Leu Phe Pro Lys Phe Gly Glu Tyr Asp Leu Val Leu Asn Pro Asp 
145 150 155 160 

He He 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 acides amin,s 

(B) TYPE: acide amin, 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .90 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Met Asn Lys Arg Met Lys Met Cys Pro Ala Cys Gin Gin Gly Tyr Leu 
15 10 15 

Tyr His Ser Lys Pro Lys Tyr Leu His Asp Glu He He Leu Cys Asp 
20 25 30 

Glu Cys Asp Ala Val Trp Leu Lys Gly Met Asn He Phe Tyr Gly Glu 
35 40 45 

Tyr Glu Lys Asp Phe Tyr Ser Tyr Val Pro Phe Met Glu Ser Gin Gly 
50 55 60 

He Thr Ser Glu Cys He Trp Glu Gly Asp Leu Phe Asp His Pro Tyr 
65 70 75 80 

Tyr Glu Asp Glu Asn Ser Asn Asp Met Asp 
85 90 



(2) INFORMATION FOR SEQ ID NO: 44: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 acides amin,s 

(B) TYPE: acide amin, 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Met Ser Ala Thr Glu lie Glu Lys Ala Lys Ala Lys lie Thr Ala Tyr 
15 10 15 

Ser Lys Leu Val Ala Gly Thr Ala Ser Ala Val Val Gly Gly Asp Val 
20 25 30 

Asn Thr Ala Ala Asn Ala Ala Gin lie Ala Val Glu Asn Asn Thr Leu 

35 40 45 

Tyr Pro Arg Cys Val Gly Ala Lys Cys Asp Glu Phe Gin Lys Glu Gin 
50 55 60 

Gin Lys Trp lie Arg Glu Asn Pro Glu Glu Tyr Arg Glu Val Leu Leu 
65 70 75 80 

Phe Gin Thr Gly Phe He Pro He He Gly Asp He Gin Ser Phe Val 

85 90 95 

Gin Ala Gin Thr Ala Ala Asp His Leu Phe Ala Leu Leu Gly Val Val 
100 105 110 

Pro Gly He Gly Glu Ser He Gin Ala Tyr Lys Val Ala Lys Ala Ala 
115 120 125 

Lys Asn Leu Gin Gly Met Lys Lys Ala Leu Asp Lys Ala Ala Thr Val 

130 135 140 

Ala Thr Ala Gin Gly Tyr Val Ser Lys Thr Lys He Lys He Gly Gin 
145 150 155 160 

Thr Glu Leu Arg Val Thr Ala Ala Thr Asp Lys Gin Leu Leu Lys Ala 
165 170 175 

He Gly Glu Gly Arg Asp Thr Thr Gly Lys Met Thr Glu Gin Leu Phe 

180 185 190 

Asp Ser Leu Ala Lys Gin Asn Gly Phe Arg Val Leu Ser Gly Gly Lys 
195 200 205 

Tyr Gly Gly Asn Asn Gly Phe Asp His Val Trp Gin Ala Ala Asp Gly 
210 215 220 

Ser Val Val Leu He Val Glu Ser Lys Gin He Arg Asn Gly Thr Val 

225 230 235 240 

Gin Leu Asn Pro Asn Gly Ala Gly Gly Tyr Thr Gin Met Ser Glu Asp 
245 250 255 

Trp He Arg Gin Val Leu Asp Gin Leu Pro Asp Gly Ser Pro Ala Lys 
260 265 270 

Ala Ala Val Phe Lys Ala Asn Lys Asn Gly Thr Leu Lys Thr Ala He 

275 280 285 

Ala Gly Val Asp Arg Gin Thr Gly Lys Ala Val He Leu Pro Val Lys 
290 295 300 

Val Pro Ser Lys Thr Asn He Arg Arg 
305 310 



(2) INFORMATION FOR SEQ ID NO: 45: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 311 acides amin,s 

(B) TYPE: acide amin, 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .311 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Met Gly His Asn Met Met Thr Thr Gin Lys Trp Tyr Glu His lie Thr 
15 10 15 

Asn Val lie lie Gly Asn Thr Ala Asn Phe Asn Ser Gly Cys Leu Asp 
20 25 30 

Ser lie Asp Tyr Val Asp Glu Arg Lys Gly Val Pro Leu Ala Ala Met 
35 40 45 

Gin His He Phe Met Asp Val Arg Ala Ala Ala Ser His Ala Tyr Leu 
50 55 60 

Phe Glu His Asp Leu Lys Lys Phe Lys Gin Tyr Ala Tyr Val Ala Gly 
65 70 75 80 

Lys Leu Gly Val Leu Leu Ser Val Asn Ser Thr Asp Pro Glu Pro Phe 

85 90 95 

Phe Phe Pro Cys Asp Met Leu Asn He Gin Asn Pro Met Phe Leu Met 
100 105 110 

Leu Met Ser Asp Ser Pro Gin Leu Arg Glu Phe Leu Val Arg Asn He 
115 120 125 

Asp Asn He Ala Asn Asp Thr Glu Ala Phe He Asn Arg Tyr Asp Leu 

130 135 140 

Asn Arg His Met He Tyr Asn Thr Leu Leu Met Val Glu Gly Lys Gin 
145 150 155 160 

Leu Asp Arg Leu Lys Gin Arg Ser Glu Lys Val Leu Ala His Pro Thr 
165 170 175 

Pro Ser Lys Trp Leu Gin Lys Arg Leu Tyr Asp Tyr Arg Phe Phe Leu 

180 185 190 

Ala Phe Ala Glu Gin Asp Ala Glu Ala Met Lys Ala Ala Leu Glu Pro 
195 200 205 

Leu Phe Asp Lys Lys Thr Ala Arg Met Ala Ala Lys Glu Thr Leu Ser 
210 215 220 

Tyr Phe Asp Phe Tyr Leu Gin Pro Gin He Val Thr Tyr Ala Lys He 

225 230 235 240 

Ala Ser Met His Gly Phe Asp Leu Gly He Asp Gin Glu He Ser Pro 
245 250 255 

Arg Asp Leu He Val Tyr Asp Pro Leu Pro Ala Asp Glu Tyr Gin Asp 
260 265 270 

He Phe Asp Phe Met Lys Gin Tyr Asp Leu Ser Tyr Pro Tyr Glu Tyr 

275 280 285 

Leu Gin Asp Trp He Asp Tyr Tyr Thr Phe Lys Thr Asp Lys Leu Val 
290 295 300 

Phe Gly Asn Ala Lys Arg Glu 
305 310 



(2) INFORMATION FOR SEQ ID NO: 46: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleotide 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
GCCACCGGTA CGGAAACTGA A 21 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 47: 
CCTGAATTCA TGTCTATTCC ATTTTGAAGA 30 

(2) INFORMATION FOR SEQ ID NO : 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
CCGAGATCTT TAACCCTTTG GGCTTAAGCG A 31 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
GGGAGATCTC CCGCTCGTGT TGTGCATTA 29 

(2) INFORMATION FOR SEQ ID NO : 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
AAGAGATCTG CAGCCAAGGC TCTCGAAA 28 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

( B ) TYPE : nuc leotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
GGGAGATCTC AGGCTGCCGC CGTTGA 26 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
GGGAGATCTC ACCCCAAGAA CGCCAAAA 28 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
GGGAGATCTG AACGTATAGT AATCTATCCA A 31 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
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AGTGGCTCCT AG 12 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
AGCACTCTCC AGCCTCTCAC CGAG 24 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
AGTGGCTCTT AA 12 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
AGTGGCTGGC 10 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
AGCACTCTCC AGCCTCTCAC CGAC 24 
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(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
GTACTTGCCT AG 12 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
ACCGACGTCG ACTATCCATG AACG 24 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
GTACTTGCTT AA 12 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
GTACTTGGGC 10 



(2) INFORMATION FOR SEQ ID NO: 63: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
ACCGACGTCG ACTATCCATG AACC 24 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 64: 
AATTCTCCCT CG 12 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
AGGCAACTGT GCTATCCGAG GGAG 24 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
GATCAACTTT TCCCTGTTTG TCCCATTACC GGTTTGAATG AACCGATTGC GCGCCGCGCG 60 
TGTTGTTGGA CATTACCTGC GATTCAGACG GTACGATTGA CCACTACATC GAGGAGAACG 120 
GCAATCAGGG TACAATGCTA 140 



(2) INFORMATION FOR SEQ ID NO: 67: 



us 7,029,845 B2 
101 102 



-continued 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
GATCCGCGTA CTTGGTTTTT CATATTTTGC ATAGTCTTGT CGGTCGGGCA TCTTCCCCGA 6 0 

CATCATCTAA ATTTGTCTTT ATTGGTTTTT ACGCCACTCA TTGCGGATAA ACAATATTCC 12 0 
GCCTTGCCGT CGCGAATGTT CAAGCTAGCC TGCATCACCG TAATCAGGTT GCCCGTTACC 180 
GAGCCTTCGA GA 192 

(2) INFORMATION FOR SEQ ID NO: 68: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 188 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
GATCCGGCTG CCCGACGCGC GCAAAATTGC CGCCGAGGAA AGCGCGCACA ACCACGACGG 6 0 

CAAAACCAGC GTATGGCAAT ACAAACATCT CGTGTTCGGT ACGGCAGGCA TTTTCTGCTA 12 0 
TGTCGGCGCG GAGGTGTCTA TCGGTTCGTT GATGGTCAAC GTATTGGGTT ATCTGAAAGG 180 
GCTGGATC 188 

(2) INFORMATION FOR SEQ ID NO : 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 304 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

GATCCCCCAC TTTACCTCGG GCAGATTTTG CGCGTTCATT ACAATAGCGT ATTTATGCGT 6 0 

TTGCGTTTGC GCTTGCCGCT GCCCCCCCCC CGCCGGTATG GGAAAACATC AATATGGCGG 12 0 

TATAAAGCGC GGTATGGCGG AAAACCTGCC GTTTCCAAGT TTTATTCATC TTTTATTCCT 18 0 

TGAGTTTGCC TTCACGGGAC GGGGCGGCGC GCGGAACGCG GGGTTCGGTA AACCGCCCGA 240 

TTCCGCGCCC GCCGAATTGC TGATTGAAAA GCTTACTTCC CCATTTTAAC TTTGCACACT 300 
GATC 304 



(2) INFORMATION FOR SEQ ID NO: 70: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
GATCAGACCC ATTTTCAGCG CACCGTAAGC GCGGATTTTC TCGAATTTTT CCAAAGCTGC 6 0 

GGCATCGTTG TTGATGTCGT CTTGCAACTC TTTGCCCGTG TAGCCCAAGT CGGCGGCATT 12 0 
CAGGAAAACG GTCGGAATGC CCGCGTTGAT GAGCGTGGCT TTCAAACGGC CTATATTCGG 180 
CACATCAATT TCATCGACCA AATTGCCGGT TGGGAACATA CTGCCTTCGC CGTCGGCTGG 240 
ATC 243 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 36 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
CGGCGGCGTA GTCCGCCGCG ACAGCGTTAC CATAAGCGGG ACAGACTACA CCCCTTTATC 6 0 

TAACCCGCAA AGTTTGGATA CGGAATTAAA ATGGTTGCTT CAAGAAGCTC CCGAAATAGA 120 
AAATCCTTTC GACCGCGCCG TTTATCTCCA TAATAATTTG GCGTATCTTC AATATTTTAA 180 
AGATTGCAAT AAACGTACTG CCAGAAACTG CATGACCTTG TCGCTGATGC GCTCCG 236 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 280 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
CGGTCAATCA CAAGAAAGTC AGCCGTCTGA TGGCGAAGAC GGGGCTGAAG GCAGTGATAT 6 0 

GGCGGCGCAA ATACCGCTCG TTCAAAGGAG AAGTCGGCAA AATTGCGCCG AATATCCTGC 12 0 
GACGCTGTTT CCATGCAGAA AAGCCGAATG AGAAATGGGT AACGGACGTT GCCGAGTTCA 180 
ATGTAGGCGG AGAAAAGATA TACCTTTCTC CGATTATGGA TTTGTTTAAC GGGGAAATCG 240 
TCAGTTACCG TATTCAGACC CGCCCGACTT TCGATTTGGC 280 



(2) INFORMATION FOR SEQ ID NO: 73: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

CGGTCAGAAA CAGGCAAGGT AATGAAAATG CCTGAGGCAC GGACTGTGCT GCGAACGAAA 6 0 

ACTCCTTACC GAAGTCTTCT ATACCCAGGC TCAATAGCCG CTCAAGGAGA GAGCTATCAT 12 0 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
CGGTCAGAAA CAGGCAAGGT AATGAAAATG CCTGAGGCAC GGACTGTGCT GCGAACGAAA 60 
ACTCCTTACC GAAGTCTTCT ATACCCAGGC TCAATAGCCG CTCAAGGAGA GAGCTATCAT 120 



(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
CGGTGTTTTT CTTAACAATT CGCCGACTTC ATGGCGATAT TTAAGTGACA GTTGCTCCGC 60 
CCACGCAGTT GCGCCGAACT CAGCACCACG ACATTATACT GATTATGCAC ATCGGCAAGA 120 
TCAAACTGAC CTATCGTAGT ATCGCAGACT GT 152 



(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
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CGGGAGGTTT TGTGCATCCT GATACCGATC GGTTGTTGTT GCTCAAAGGA CAGAAGGCCG 60 

CTGATAAACG AGATTACCTG TTTGTCGCTA TTGACGATTT TTATACTCTG CCATTTTGCC 120 

AGACAAAACC GCAGACAGTG CTGCCAAGTT TCTGACCGAA CATCTGGCCG ACCCCTGCTT 180 

GTACCTGATT GAGTACGCTT ACTCTGACAA TGATAGGTAA TATAAAGAGC CGTCCAACAT 240 

GCTTTCGGTG CAGTTTGTTA TGATAATGGG ATTGGTTGGA GGCTTGCCCG ATTTGCTTGT 300 

CCGCAGACCA ACGGTAAGGC GGAGCGGGTT ATCCGTACCT TGATGGAGAT GTGGCATGAG 360 

GAACAGTCGT TTGACAGACC G 381 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
CGGAGCATAA AATCGTTATT AAAGATAATG GTATAGGAAC GAGCTTCGAT GAAATCAATG 60 
ATTTTTATTT GAGAATCGGT CGGAACAGAA GGGAAGAAAA ACAAGCCTCC CCGTGCGGAA 120 
GAATTCCAAC GGGTAAAAAA GGCCTTGGTA AATTGGCATT ATTCGGGCTT GGCAACAAAA 180 
TTGAAATTTC TACTATCCAG GGAAACGAAA GGGTTACTTT TACTTTGGAT TATGCAGAGA 240 
TTCGAAGAAG CAAGGGTATT TATCAACCG 269 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 203 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
CGGATGAAAA CGGCATACGC GCCAAAGTAT TTACGAACAT CAAAGGCTTG AAGATACCGC 60 
ACACCTACAT AGAAACGGAC GCGAAAAAGC TGCCGAAATC GACAGATGAG CAGCTTTCGG 120 
CGCATGATAT GTACGAATGG ATAAAGAAGC CCGAAAATAT CGGGTCTATT GTCATTGTAG 180 
ATGAAGCTCA AGACGTATGG CCG 203 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 229 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANT I- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



CGGTTTCAGG TTGTCGCGAA GGCTCGGTAA CGGGCAACCT GATTACGGGT GATGCAGGCA 



60 



GCTTGAACAT TCGCGACGGC AAGGCGGAAT ATGTTTATCC GCAATGAGTG GCGTAAAAAC 



120 



CAATAAAGAC AAATTTAGAT GATGTCGGGG AAGATGCCCG ACCGACAAGA CTATGCAAAA 180 



(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 07 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
CGGGTCGCTT TATTTTGTGC AGGCATTATT TTTCATTTTT GGCTTGACAG TTTGGAAATA 60 
TTGTGTATCG GGGGGGGGTA TTTGCTGACG TAAAAAACTA TAAACGCCGC GCAAAATATG 120 
GCTGACTATA TTATTGACTT TGATTTTGTC CTGCGCGGTG ATGGATAAAA TCGCCAGCGA 180 
TAAAGAATTT GCGAGAACCT GATGCCG 207 

(2) INFORMATION FOR SEQ ID NO: 81 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
CGGCAACGAT TTGAGCTATC GCGGTTACGA CATTCTGGAT TTGGCACAAA AATGCGAGTT 60 
TGAAGAAGTC GCCCACCTGC TGATTCACGG CCATCTGCCC AACAAATTCG AGCTGGCCGC 120 
TTATAAAACC AAGCTCAAAT CCATGCGCGG CCTGCCTATC CGTGTGATTA AAGTTTTGGA 180 
AAGCCTGCCT GCACATACCC ATCCGATGGA CGTAATGCGT ACCG 224 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



TATGAAAAAC CAAGTACGCG GATCAGGCAT GGATGCACGA TCCAATCCG 



229 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
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CGGGAACAGC CATTGCCCAC GCCCACGCCC CCCAAGAAAG ACGGAAACTA CTGCCTAAAT 



60 



TTTCGGCAAT CAAGTTGACG ATTAAAGGGT TGGGGGCAGT TGCAGTAATA AACATAGCCG 120 
ACGAAATGGG ATTGGAATGA TAGTTGACCA AAGCCAAATA TTTACCCATC TTGCCTTCTG 180 



(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 353 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
CGGGAATTCT GAGCAGAATG AAAGAAAGCA GGCTTGATAA TTTCATAAAG TTATTGGAAG 6 0 

AAAAAGGATT TACCGTCCAT TTCGGTATTC ACAATACGGC TGATTACGGA ATTCCCCAAA 120 
GCCGTAAAAG ATTTACGTTA ATTGCAAACA GAATAACCAA AGAAAAGCTG GAACCAGTCA 180 
AGTATTCGGG CAAACGGCTT ACGGTAGCCG ATGTTTTGGG AATGGAAATG GCTTTCCCAA 240 
CATTATTGCA GGACACCAAG ACGAAACGGA TTTTATGCAT AGCTGTGCGG GAATTATCTG 300 
ATATCACTTG AACGATTGGC TTGATACCTA AAAACGGAGG AACCGTTGGC TTT 353 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
AATTCCGTAT CCAAACTTTG CGGGTTAGAT AAAGGGGTGT AGTCTGTCCC GCTTATGGTA 60 
ACGCTGTCGC GGCGGACTAC GCCCGGAGCC TTTTTCCAGT AAGTTTTCGG AAATCAGGCT 120 
GTGGGTGGTT TTTAAGAAAT CCAACCAGTC AAACGGCTCG GGGCTGTCCA AACCGGACAC 180 
AGGTGCCGGT AACTTTCCCT CAGGTTGATT AACATTACGG CATCCGAATA TAACTTCCCG 240 
CCTGCGGTTT GCCCGAGTTT AAGCAATGCC TGCGTATCGT ATTGATTATA AAGTGTTTCC 300 
TTCCAATT 308 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



TGCCTTTTGC GGGATTGGAG CCGTAACTGC CG 



212 



(iii) HYPOTHETICAL: NO 
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(iv) ANT I- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



AATTCGTGTG CCGCGTCGAC AAACCGCTGA CGTAGCGGAT GTCTCATGCC ACGTTTCAAA 



60 



GCAGGTTGAT GGCGGTTAGC AACCCTCTGA TTTCACTGGG ATAT 



104 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
AATTGCGTAG AGTGGGCTTC AGCCACGTTT TTTCTTTTTC GGTCGTTGAT TGGTGGGCTG 6 0 

AACCACTTGT TTCGGAAATC CGTATCATG 89 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

AATTTCCACC TATGCCCTAC GCAGCGATTA TCCGTGGTTT ACCCAAAGGG TGATTATGGC 6 0 

AAAAGCGCGG GGTTGAGCGA CCGCCTTTTG TTGCCGGCGT TCAAACGGGT TTTGATAGGA 12 0 

AATGCAGGCA CGAAGCCTCG GCTGATTGTG ATGCACCTGA TGGGTTCGCA CAGTGATTTT 18 0 

TGCACACGTT TGGATAAGGA TGCGCGGCGG TTTCAGTATC AAACTGAAAA AATATCCTGC 240 

TATGTTTCCA TCAATCGCGC J^CCGATAA ATT 273 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
AATTCTTCCG CACGGGGAGG CTTGTTTTTC TTCCCTTCTG TTCCGACCGA TTCTCAAATA 6 0 

AAAATCATTG ATTTCATCGA AGTTCATTCC TATACCATTA TCTTTAATAA CGATTTTATG 12 0 



CTCCGGTTTA TCGAATAACC TAACTTCCAC TTCCGTAGCA CATGCATCGT AGGCATTCGC 



180 
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TATCAACTCG GCAATCGCAG GAACAGTGTG CGAATACAAT CTTTACACCC AAATGTTCGA 240 
TTACGGTTGG CTCGAAACTC AATTTCAATT 270 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
AATTATGAAC ACACGCATCA TCGTTTCGGC TGCGTTCGTT GCGTTGGCAT TAGCAGGTTG 
CGGCTCAATC AATAATGTAA CCGTTTCCGA CCAGAAACTT CAGGAACGTG CCGCGTTTGC 
CTTGGGCGTC ACCAATGCCG TAAAAATCAG CAACCGCAGC AATGAAGGCA TACGCATCAA 
CTTTACCGCA ACTGTGGGTA AGCGCGTGAC CAATGCTATG TTACCAGTGT AATCAGCACA 
ATCGGCGTTA CCACTTCCGA TGCAATT 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
AATTTTTATT TGGTTCGTAG TCATTTTGTG CAACTGAACG ATATTCGTTT TCATCATTGC 6 0 

TAACGTCTAG TGCCCATTGT GGCCCGTAAT AAGAGATTTC GTCTCCTTTT ACATGTTTGA 12 0 
CGCTGACGGC ATACTGGGGA TCGATGACGG ATAATGTACG TCTGTTGACA TCTGCAACGC 180 
TAAATCAATC ATCGGTATTG GATAATGCGT TGCCGATGTT TTGACTTGTA TGTT 234 

(2) INFORMATION FOR SEQ ID NO : 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 295 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
AATTCGGCCG GCTGTGTCAA ATAATGCGTT ACTTTGGCCG GGTCTTGTTC TTTGTAAGTG 6 0 

GTGGTCTTTT TTTGCGCGTT ATCCCCATCT GTTTGAGTGC ATAGCAAATG GTGGCTGCCG 12 0 



60 
120 
180 
240 
267 



TACAATCAAA TGTTTGGCGT TCATGCAGAT AGGCATCATG GTGTTGCCCA ATATATTGAG 
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CCGGTTTTTG CCTATCCGAT TTGACGGCAT TTAGACCGGT AACTTGATGT TTTAAGCTGC 240 
CTGTTTGTTT AAAGGCGAAT CCACAAGTAA AGCGTGTTTC TTGACAGGTT AAACG 295 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
AATTGTGTAT ATCAAGTAGG ATGGGCATTT ATGCCTGACC TACAAAACCA AAAACAACCT 6 0 

ACCACCCTTA ATCAACTCCA CAAACCCTCT TCAGACAACC TCGTTTTTTG AAAAACAATC 12 0 

TGTAAACAGA TAACTGCTGA AGAATACCGT TGCCGAGCCC CAAAACCCGT ACTGCAACTT 18 0 
TTATTGTGAA CTTCCCATTA TGAGAAAATC CCTTTTCGTC CTCTTTCTGT ATTCGTCCCT 240 
ACTTACTGCC AGCGAAATT 259 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 379 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
AATTGCACCA CGCGATGATG GGTACGCCTC TGTTGCCATT GCGACCGCCG CCGCCGTGCC 6 0 

CGGTACGCTG GTCAACCTTG CCGCGGCGGA ACGGGTAAAG AAGTGCGCTT CGGGCATCCT 12 0 
TCCGGTACAT TGCGCGTCGG TGCAGCGCCG AATGTCAGGA CGGACAATGG ACGGCCACCA 180 
AAGCGGTTAT GAGCCGCAGC GCACGCGTGA TGATGGAAGG TTGGGTCAGG GTGCCGGAAG 240 
ATTGTTTTTA AATTGGACGG CGAACCGGTC TATTCGTATT GGCGTTATAC CGCCGCAAAG 300 
GCAGACCTTG AAACTGGTGC GTGCCGTGCA GGGCATGTAC GGCTATGTGT GCGTGGCGGG 360 
CGGATTTGAT GTGCGGAAT 379 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
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AATTTGTTGG GCAGATGGCC GTGAATCAGC AGGTGGGCGA CTTCTTCAAA CTCGCATTTT 



60 



TGTGCCAAAT CCAGAATGTC GTAACCGCGA TACGTCAAAT CGTTGCCGGT ACGCAACGGT 120 

ACACAAAGCG GTATTACCGG CCGCAACGCC AGAAAGCGCA ACGGATTTTT AGGTTTGAGG 180 

GTCGGGGTTT GAGTAGTTTC AGTCATGGTA TTTCTCCTTT GTGTTTTTAT GGGTTTCGGG 240 

TTTTCAGACG ACCGATGCGG ATTTGTTGAA AGGCAGTCTG AAAGCGGTAA ATCATTTTTG 300 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 86 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
AATTCGGAGG AGCAGTACCG CCAAGCGTTG CTCGCCTATT CCGGCGGTGA TAAAACAGAC 60 
GAGGGTATCC GCCTGATGCA ACAGAGCGAT TACGGCAACT TGTCCTACCA CATCCGTAAT 120 
AAAAACATGC TTTTCATTTT TTCGGCAAGC AATGACGCAC AAGCTCAGCC CAACACAACT 180 
GACCCTATTG CCATTTTATG AAAAAGACGC TCAAAAAGGC ATTATCACAG TTGCAGGCGT 240 
AGACCGCAGT GGAGAAAAGT TCAATGGCTC CAACCATTGC GGAATT 286 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
AATTTGGATA CGTTGGAAAA GGGATATTTG ATTGGGAATG GGATGAAGAT AAGCGTAGAT 60 
GAGTTGGGGA AAAAAGTGTT AGAACATATC GGTAAGAATG AACCGTTATT GTTGAAAAAT 120 
CTACTGGTTA ACTTCAATCA GGGAAAACAT GAAGAAGTTA GGAAGTTGAT TTATCAGTTG 180 
ATAGAGTTAG ATTTTCTGGA ACTTTTGTGA GGGATTCTAT GAAAAACTGG AAGCAATT 238 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



AAACAATT 



308 



(iv) ANT I- SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

AATTCGGCAC GCAGGTTTTC TAAAAAAAGG CCGTTGATGA CTTTGTCGAT ATTGGCGGCT 60 

TCGGTGTAGT GCGCGCCCGC TTCGGCCGCT CTTGCGCGTC CATGACGGAT TGGAAGAGCG 120 

TGCCGAAGAT TTCTGGACTG ATGTTGCGCC AGTCGAAATT GCCGACACGG GAGGAATACC 180 

TGCCAACAAG AGTGCAGGCA GCGTAATCAA ACCACCCCCA CCCGCAATCG CATCGATAAA 240 

TCCGGCAATC ATCGCAACCA AACCCAAAGC GAGTATTATG TATAAATCTT CCATGTTTCT 300 

TAATCCTGTT AACTTGCACC AA 322 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 316 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



CCCACATTTT GGAAGC 



(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 217 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
316 



(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
AATTTGTCGG CAATCTTCCC GGGTCGCTTT ATTTTGTGCA GGCATTATTT TTCATTTTTG 
GCTTGACAGT TTGGAGATAT TGTGTATCGG GGGGGGGTAT TTGCTGACGT AAAAAACTAT 
AAACGCCGCA GCAAAATATG GCTGACTATA TTATTGACTT TGATTTTGTC CTGCGCGGTG 

ATGGATAAAA TCGCCAGCGA TAAAGATTTG CGAGAACCTG ATGCCGGCCT GTTGTTGAAT 
ATTTTCGACC TGTAATTACG ATTTGGCTTC CGCGCCGGCA CAATATGCCG CCAAGCGGCG 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
AATTCGGACA GTATGAATAC AGCGGATTAA TACAAGGTAA GTTCATTACA ACGGAAAAAC 60 
CTTTAAAGAA TAATATGAAA GGTATTACCT TGTTTGCCAA CGGGAATGGT AAATATGCCC 120 
GAGTTTTTCA CTGAATAGCG AATCCAGCCA TTTCTATTCA TATTTGACTG GATGGCTGAA 180 
TGTGGACTTT ATAGATAATG ACGATGAAGA TTTAATT 217 



What is claimed is: 

1. An isolated DNA which is specific to Neisseria men- 
ingitidis (Nm) and Neisseria gonorrheae (Ng), and hybrid- 
izes on a Southern blot to SEQ ID NO: 95 and does not 
hybridize on a Southern blot to a DNA sequence of Neisseria 
lactamica (Nl) strain N18064, under the following hybrid- 
ization conditions: 18 h at 65° C, with a solution comprising 
0.5 M NaP04 pH 7.2, 0.001 M EDTA-Na, 1% bovine serum 



albumin and 7% sodium dodocyl sulphate, followed by at 
least two washes in a solution comprising 40 mM Na PO4 
pH 7.2, 1 mM EDTA, and 1% SDST, the final wash being 
conducted at 85° C. for 5 minutes, or the complement of said 
isolated DNA which is specific to Neisseria meningitidis 
65 (Nm) and Neisseria gonorrheae (Ng), 

provided that said DNA or the complement of said 
isolated DNA is not pilC, or a gene involved in the 



us 7,029^ 

123 

biosynthesis of any one of the polysaccharide capsule, 
IgA proteases, pilin, a protein which binds transferin, a 
protein which binds lactoferin, and an opacity protein 
said DNA being within an islet involved in the poloniza- 
tion of the nasopharynx or invasion of the submucousal ^ 
space or systemic dissemination of Nm. 

2. A composition comprising the DNA or complement of 
claim 1 and a carrier. 

3. An isolated DNA which is specific to Neisseria mem- 
ingitidis (Nm) and Neisseria gonorrheae (Ng), and hybrid- 
izes on a Southern blot to SEQ ID NO: 95 and does not 
hybridize on a Southern blot to a DNA sequence of Neisseria 
lactamica (Nl) strain N18064, under the following hybrid- 
ization conditions: 16 h at 65° C, with a solution comprising 
0.5 M NaP04, pH 7.2, M EDTA-Na, 1% bovine serum 15 
albumin and 7% sodium dodecylsulphate, followed by at 
least two washes in a solution comprising 40 mM Na PO4 
pH7.2, 1 mM EDTA and 1% SDS, the final wash being 
conducted at 65° C. for 5 minutes, or the complement of said 
isolated DNA which is specific to Neisseria meningitidis 20 
(Nm) and Neisseria gonorrheae (Ng), 

provided that said DNA or the complement of said 
isolated DNA is not pilC, or a gene involved in the 
biosynthesis of any one of the polysaccharide capsule, 
IgA proteases, pilin, a protein which binds transferrin, 
a protein which binds lactoferin, and an opacity protein. 

4. A composition comprising the DNA or complement of 
claim 3 and a carrier. 

5. A method of detecting Neisseria meningitidis (Nm) or 
Neisseria gonorrheae (Ng) in a sample from a patient said 
method comprising providing isolated sample DNA of said 
sample from the patient, contacting said sample DNA with 
the isolated DNA of claim 1, said contacting being per- 
formed under conditions which allow hybridization of said 
sample DNA and said isolated DNA, and detecting any 
hybridization of said sample DNA and said isolated DNA, 
such that said hybridization of said sample DNA and said 
isolated DNA specifically indicates the presence of said Nm 
or Ng in said sample, 

wherein said isolated DNA is specific to Nm and Ng and 
does not hybridize on a Southern blot to a DNA 
sequence of Neisseria lactamica (Nl) strain N18064 
under the following hybridization conditions: 18 h at 
65° C, with a solution comprising 0.5 M NaP04 pH 
7.2, 0.001 M EDTA-Na, 1% bovine serum albumin and 
7% sodium dodecylsulphate, followed by at least two 
washes in a solution comprising 40 mM Na PO4 pH 
7.2, 1 mM EDTA, and 1% SDS, the final wash being 
conducted at 65° C. for 5 minutes, and 

wherein said isolated DNA specifically hybridizes to Nm 
DNA and Ng DNA in the presence of Nl DNA. 

6. A method of detecting Neisseria meningitidis (Nm) or 
Neisseria gonorrheae (Ng) in a sample from a patient, said 
method comprising providing isolated sample DNA of said 55 
sample from the patient, contacting said sample DNA with 
the composition of claim 1 under conditions which a low 
hybridization of said sample DNA and DNA in said 
composition, and detecting any hybridization of said sample 
DNA and said DNA in said composition, such that said 50 
hybridization of said sample DNA and said DNA in said 
composition specifically indicates the presence of said Nm 

or Ng in said sample, 
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wherein said DNA in said composition is specific to Nm 
and Ng and does not hybridize on a Southern blot to a 
DNA sequence of Neisseria lactamica (Nl) strain 
N18064 under the following hybridization conditions: 
16 h at 65° C, with a solution comprising 0.5 M NaP04 
pH 7.2, 0.001 M EDTA-Na, 1% bovine serum albumin 
and 7% sodium dodecylsulphate, followed by at least 
two washes in a solution comprising 40 mM Na PO4 
pH 7.2, 1 mM EDTA, and 1% SDS, the final wash 
being conducted at 65° C. for 5 minutes, and 

wherein said DNA is said composition specifically 
hybridizes to Nm DNA and Ng DNA in the presence of 
NIDNA. 

7. A method of detecting Neisseria meningitidis (Nm) of 
Neisseria gonorrheae (Ng) in a sample from a patient, said 
method comprising providing isolated sample DNA of said 
sample from the patient, contacting said sample DNA with 
the isolated DNA of claim 3, said contacting being per- 
formed under conditions which allow hybridization of said 
sample DNA and said isolated DNA, and detecting any 
hybridization of said sample DNA and said isolated DNA, 
such that said hybridization of said sample DNA and said 
isolated DNA specifically indicates the presence of said Nm 
or Ng in said sample, 

wherein said isolated DNA is specific to Mm and Ng and 
does not hybridize on a Southern blot to a DNA 
sequence of Neisseria lactamica (Nl) strain N18064 
under the following hybridization conditions: 16 h at 
65° C, with a solution comprising 0.5 M NaP04 pH 
7.2, 0.001 M EDTA-Na, 1% bovine serum albumin and 
7% sodium dodecylsulphate, followed by at least two 
washes in a solution comprising 40 mM Na PO4PH 7.2, 
1 mM EDTA, and 1% SDS, the final wash being 
conducted at 65° C. for 5 minutes, and 

wherein said isolated DNA specifically hybridizes to Nm 
DNA and Ng DNA in the presence of Nl DNA. 

8. A method of detecting Neisseria meningitidis (Nm) or 
Neisseria gonorrheae (Ng) in a sample from a patient, said 
method comprising providing isolated sample DNA of said 
sample from the patient, contacting said sample DNA with 
the composition of claim 4 under conditions which allow 
hybridization of said sample DNA and DNA in said 
composition, and detecting any hybridization of said sample 
DNA and said DNA in said composition, such that said 
hybridization of said sample DNA and said DNA in said 
composition specifically indicates the presence of said Nm 
or Ng in said sample, 

wherein said DNA in said composition is specific to Nm 
and Ng and does not hybridize on a Southern blot to a 
DNA sequence of Neisseria lactamica (Nl) strain 
N18064 under the following hybridization conditions: 
16 h at 65° C, with a solution comprising 0.5 M NaP04 
pH 7.2, 0.001 M EDTA-Na, 1% bovine serum albumin 
and 7% sodium dodecylsulphate, followed by at least 
two washes in a solution comprising 40 mM Na PO4 
pH 7.2, 1 mM EDTA, and 1% SDS, the final wash 
being conducted at 65° C. for 5 minutes, and 

wherein said DNA is said composition specifically 
hybridizes to Nm DNA and Ng DNA in the presence of 
Nl DNA. 



